Overview
Brought to you by YData
Dataset statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Number of variables | 78 | 78 |
| Number of observations | 1000000 | 30000 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Total size in memory | 595.1 MiB | 17.9 MiB |
| Average record size in memory | 624.0 B | 624.0 B |
Variable types
| Full Dataset | Stratified Sample | |
|---|---|---|
| Numeric | 40 | 40 |
| Text | 38 | 38 |
| Full Dataset | Stratified Sample | |
|---|---|---|
customer_id has unique values | customer_id has unique values | Unique |
membership_years has 99846 (10.0%) zeros | membership_years has 3030 (10.1%) zeros | Zeros |
number_of_children has 199753 (20.0%) zeros | number_of_children has 5930 (19.8%) zeros | Zeros |
transaction_hour has 41756 (4.2%) zeros | transaction_hour has 1277 (4.3%) zeros | Zeros |
avg_discount_used has 10010 (1.0%) zeros | Alert not present in this dataset | Zeros |
in_store_purchases has 10016 (1.0%) zeros | in_store_purchases has 321 (1.1%) zeros | Zeros |
total_returned_items has 100060 (10.0%) zeros | total_returned_items has 3026 (10.1%) zeros | Zeros |
product_stock has 10174 (1.0%) zeros | product_stock has 317 (1.1%) zeros | Zeros |
customer_support_calls has 49755 (5.0%) zeros | customer_support_calls has 1525 (5.1%) zeros | Zeros |
website_visits has 10111 (1.0%) zeros | Alert not present in this dataset | Zeros |
Reproduction
| Full Dataset | Stratified Sample | |
|---|---|---|
| Analysis started | 2025-06-06 02:26:31.335908 | 2025-06-06 02:28:30.874457 |
| Analysis finished | 2025-06-06 02:28:30.840000 | 2025-06-06 02:28:35.688336 |
| Duration | 1 minute and 59.5 seconds | 4.81 seconds |
| Software version | ydata-profiling vv4.16.1 | ydata-profiling vv4.16.1 |
| Download configuration | config.json | config.json |
Variables
customer_id
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 1000000 | 30000 |
| Distinct (%) | 100.0% | 100.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500000.5 | 500803.1024 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 12 |
| Maximum | 1000000 | 999880 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 12 |
| 5-th percentile | 50000.95 | 51637.35 |
| Q1 | 250000.75 | 249894 |
| median | 500000.5 | 502067.5 |
| Q3 | 750000.25 | 752820.25 |
| 95-th percentile | 950000.05 | 947160.85 |
| Maximum | 1000000 | 999880 |
| Range | 999999 | 999868 |
| Interquartile range (IQR) | 499999.5 | 502926.25 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288675.2789 | 288853.064 |
| Coefficient of variation (CV) | 0.5773499805 | 0.5767797018 |
| Kurtosis | -1.2 | -1.208484499 |
| Mean | 500000.5 | 500803.1024 |
| Median Absolute Deviation (MAD) | 250000 | 251514 |
| Skewness | -2.511790261 × 10-15 | -0.008815474411 |
| Sum | 5.000005 × 1011 | 1.502409307 × 1010 |
| Variance | 8.333341667 × 1010 | 8.343609261 × 1010 |
| Monotonicity | Strictly increasing | Not monotonic |
| Value | Count | Frequency (%) |
| 999984 | 1 | < 0.1% |
| 999983 | 1 | < 0.1% |
| 999982 | 1 | < 0.1% |
| 999981 | 1 | < 0.1% |
| 999980 | 1 | < 0.1% |
| 999979 | 1 | < 0.1% |
| 999978 | 1 | < 0.1% |
| 999977 | 1 | < 0.1% |
| 999976 | 1 | < 0.1% |
| 999975 | 1 | < 0.1% |
| Other values (999990) | 999990 |
| Value | Count | Frequency (%) |
| 162060 | 1 | < 0.1% |
| 962145 | 1 | < 0.1% |
| 988334 | 1 | < 0.1% |
| 703785 | 1 | < 0.1% |
| 729865 | 1 | < 0.1% |
| 944983 | 1 | < 0.1% |
| 259621 | 1 | < 0.1% |
| 507975 | 1 | < 0.1% |
| 5872 | 1 | < 0.1% |
| 403580 | 1 | < 0.1% |
| Other values (29990) | 29990 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 12 | 1 | |
| 18 | 1 | |
| 34 | 1 | |
| 125 | 1 | |
| 147 | 1 |
| Value | Count | Frequency (%) |
| 12 | 1 | |
| 18 | 1 | |
| 34 | 1 | |
| 125 | 1 | |
| 147 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
age
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 62 | 62 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 48.496605 | 48.33273333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 18 | 18 |
| Maximum | 79 | 79 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 18 | 18 |
| 5-th percentile | 21 | 21 |
| Q1 | 33 | 33 |
| median | 49 | 48 |
| Q3 | 64 | 64 |
| 95-th percentile | 76 | 76 |
| Maximum | 79 | 79 |
| Range | 61 | 61 |
| Interquartile range (IQR) | 31 | 31 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 17.87438116 | 17.89495728 |
| Coefficient of variation (CV) | 0.3685697414 | 0.3702450917 |
| Kurtosis | -1.198117884 | -1.195486737 |
| Mean | 48.496605 | 48.33273333 |
| Median Absolute Deviation (MAD) | 15 | 15 |
| Skewness | -0.0002769945754 | 0.009864896331 |
| Sum | 48496605 | 1449982 |
| Variance | 319.493502 | 320.2294962 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 53 | 16423 | 1.6% |
| 54 | 16412 | 1.6% |
| 33 | 16407 | 1.6% |
| 36 | 16363 | 1.6% |
| 62 | 16324 | 1.6% |
| 39 | 16290 | 1.6% |
| 34 | 16284 | 1.6% |
| 40 | 16274 | 1.6% |
| 32 | 16264 | 1.6% |
| 19 | 16248 | 1.6% |
| Other values (52) | 836711 |
| Value | Count | Frequency (%) |
| 20 | 539 | 1.8% |
| 78 | 536 | 1.8% |
| 36 | 533 | 1.8% |
| 62 | 527 | 1.8% |
| 37 | 523 | 1.7% |
| 57 | 516 | 1.7% |
| 25 | 516 | 1.7% |
| 30 | 514 | 1.7% |
| 58 | 514 | 1.7% |
| 21 | 508 | 1.7% |
| Other values (52) | 24774 |
| Value | Count | Frequency (%) |
| 18 | 16003 | |
| 19 | 16248 | |
| 20 | 16116 | |
| 21 | 16016 | |
| 22 | 16211 |
| Value | Count | Frequency (%) |
| 18 | 465 | |
| 19 | 490 | |
| 20 | 539 | |
| 21 | 508 | |
| 22 | 500 |
| Value | Count | Frequency (%) |
| 18 | 465 | |
| 19 | 490 | |
| 20 | 539 | |
| 21 | 508 | |
| 22 | 500 |
| Value | Count | Frequency (%) |
| 18 | 16003 | |
| 19 | 16248 | |
| 20 | 16116 | |
| 21 | 16016 | |
| 22 | 16211 |
gender
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 5 | 5 |
| Mean length | 5.001174 | 4.9948 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Other | Other |
| 2nd row | Female | Female |
| 3rd row | Female | Other |
| 4th row | Female | Male |
| 5th row | Female | Female |
| Value | Count | Frequency (%) |
| other | 333734 | |
| female | 333720 | |
| male | 332546 |
| Value | Count | Frequency (%) |
| male | 10085 | |
| other | 9986 | |
| female | 9929 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39929 | |
| a | 20014 | |
| l | 20014 | |
| M | 10085 | 6.7% |
| O | 9986 | 6.7% |
| t | 9986 | 6.7% |
| h | 9986 | 6.7% |
| r | 9986 | 6.7% |
| F | 9929 | 6.6% |
| m | 9929 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 149844 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39929 | |
| a | 20014 | |
| l | 20014 | |
| M | 10085 | 6.7% |
| O | 9986 | 6.7% |
| t | 9986 | 6.7% |
| h | 9986 | 6.7% |
| r | 9986 | 6.7% |
| F | 9929 | 6.6% |
| m | 9929 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 149844 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39929 | |
| a | 20014 | |
| l | 20014 | |
| M | 10085 | 6.7% |
| O | 9986 | 6.7% |
| t | 9986 | 6.7% |
| h | 9986 | 6.7% |
| r | 9986 | 6.7% |
| F | 9929 | 6.6% |
| m | 9929 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 149844 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39929 | |
| a | 20014 | |
| l | 20014 | |
| M | 10085 | 6.7% |
| O | 9986 | 6.7% |
| t | 9986 | 6.7% |
| h | 9986 | 6.7% |
| r | 9986 | 6.7% |
| F | 9929 | 6.6% |
| m | 9929 | 6.6% |
income_bracket
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.333713 | 4.3337 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | Medium | Low |
| 3rd row | Low | Low |
| 4th row | Low | Medium |
| 5th row | Low | High |
| Value | Count | Frequency (%) |
| high | 333612 | |
| medium | 333367 | |
| low | 333021 |
| Value | Count | Frequency (%) |
| high | 10008 | |
| medium | 10001 | |
| low | 9991 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20009 | |
| H | 10008 | |
| g | 10008 | |
| h | 10008 | |
| M | 10001 | |
| e | 10001 | |
| d | 10001 | |
| u | 10001 | |
| m | 10001 | |
| L | 9991 | |
| Other values (2) | 19982 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130011 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20009 | |
| H | 10008 | |
| g | 10008 | |
| h | 10008 | |
| M | 10001 | |
| e | 10001 | |
| d | 10001 | |
| u | 10001 | |
| m | 10001 | |
| L | 9991 | |
| Other values (2) | 19982 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130011 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20009 | |
| H | 10008 | |
| g | 10008 | |
| h | 10008 | |
| M | 10001 | |
| e | 10001 | |
| d | 10001 | |
| u | 10001 | |
| m | 10001 | |
| L | 9991 | |
| Other values (2) | 19982 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130011 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20009 | |
| H | 10008 | |
| g | 10008 | |
| h | 10008 | |
| M | 10001 | |
| e | 10001 | |
| d | 10001 | |
| u | 10001 | |
| m | 10001 | |
| L | 9991 | |
| Other values (2) | 19982 |
loyalty_program
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499712 | 2.499666667 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | No |
| 3rd row | No | Yes |
| 4th row | No | No |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| no | 500288 | |
| yes | 499712 |
| Value | Count | Frequency (%) |
| no | 15010 | |
| yes | 14990 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15010 | |
| o | 15010 | |
| Y | 14990 | |
| e | 14990 | |
| s | 14990 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74990 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15010 | |
| o | 15010 | |
| Y | 14990 | |
| e | 14990 | |
| s | 14990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74990 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15010 | |
| o | 15010 | |
| Y | 14990 | |
| e | 14990 | |
| s | 14990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74990 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15010 | |
| o | 15010 | |
| Y | 14990 | |
| e | 14990 | |
| s | 14990 |
membership_years
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 10 | 10 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4.497453 | 4.512466667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 9 | 9 |
| Zeros | 99846 | 3030 |
| Zeros (%) | 10.0% | 10.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 2 | 2 |
| median | 4 | 5 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 5 | 5 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.872405571 | 2.876847891 |
| Coefficient of variation (CV) | 0.6386738385 | 0.637533328 |
| Kurtosis | -1.22454665 | -1.222483107 |
| Mean | 4.497453 | 4.512466667 |
| Median Absolute Deviation (MAD) | 3 | 2 |
| Skewness | 0.001590463324 | -0.009342021881 |
| Sum | 4497453 | 135374 |
| Variance | 8.250713764 | 8.276253791 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 100686 | |
| 5 | 100183 | |
| 4 | 100137 | |
| 9 | 99977 | |
| 2 | 99964 | |
| 8 | 99891 | |
| 6 | 99865 | |
| 0 | 99846 | |
| 7 | 99728 | |
| 3 | 99723 |
| Value | Count | Frequency (%) |
| 5 | 3096 | |
| 9 | 3060 | |
| 0 | 3030 | |
| 6 | 3021 | |
| 7 | 3020 | |
| 3 | 3008 | |
| 1 | 2998 | |
| 8 | 2951 | |
| 4 | 2913 | |
| 2 | 2903 |
| Value | Count | Frequency (%) |
| 0 | 99846 | |
| 1 | 100686 | |
| 2 | 99964 | |
| 3 | 99723 | |
| 4 | 100137 |
| Value | Count | Frequency (%) |
| 0 | 3030 | |
| 1 | 2998 | |
| 2 | 2903 | |
| 3 | 3008 | |
| 4 | 2913 |
| Value | Count | Frequency (%) |
| 0 | 3030 | |
| 1 | 2998 | |
| 2 | 2903 | |
| 3 | 3008 | |
| 4 | 2913 |
| Value | Count | Frequency (%) |
| 0 | 99846 | |
| 1 | 100686 | |
| 2 | 99964 | |
| 3 | 99723 | |
| 4 | 100137 |
churned
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499729 | 2.496266667 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | No | Yes |
| 2nd row | No | Yes |
| 3rd row | No | No |
| 4th row | No | No |
| 5th row | Yes | Yes |
| Value | Count | Frequency (%) |
| no | 500271 | |
| yes | 499729 |
| Value | Count | Frequency (%) |
| no | 15112 | |
| yes | 14888 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15112 | |
| o | 15112 | |
| Y | 14888 | |
| e | 14888 | |
| s | 14888 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74888 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15112 | |
| o | 15112 | |
| Y | 14888 | |
| e | 14888 | |
| s | 14888 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74888 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15112 | |
| o | 15112 | |
| Y | 14888 | |
| e | 14888 | |
| s | 14888 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74888 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15112 | |
| o | 15112 | |
| Y | 14888 | |
| e | 14888 | |
| s | 14888 |
marital_status
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 8 | 8 |
| Median length | 7 | 7 |
| Mean length | 7.000866 | 7.001233333 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Divorced | Divorced |
| 2nd row | Married | Divorced |
| 3rd row | Married | Divorced |
| 4th row | Divorced | Married |
| 5th row | Divorced | Married |
| Value | Count | Frequency (%) |
| divorced | 333816 | |
| married | 333234 | |
| single | 332950 |
| Value | Count | Frequency (%) |
| married | 10067 | |
| divorced | 9985 | |
| single | 9948 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30119 | |
| i | 30000 | |
| e | 30000 | |
| d | 20052 | |
| a | 10067 | 4.8% |
| M | 10067 | 4.8% |
| D | 9985 | 4.8% |
| v | 9985 | 4.8% |
| o | 9985 | 4.8% |
| c | 9985 | 4.8% |
| Other values (4) | 39792 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210037 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30119 | |
| i | 30000 | |
| e | 30000 | |
| d | 20052 | |
| a | 10067 | 4.8% |
| M | 10067 | 4.8% |
| D | 9985 | 4.8% |
| v | 9985 | 4.8% |
| o | 9985 | 4.8% |
| c | 9985 | 4.8% |
| Other values (4) | 39792 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210037 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30119 | |
| i | 30000 | |
| e | 30000 | |
| d | 20052 | |
| a | 10067 | 4.8% |
| M | 10067 | 4.8% |
| D | 9985 | 4.8% |
| v | 9985 | 4.8% |
| o | 9985 | 4.8% |
| c | 9985 | 4.8% |
| Other values (4) | 39792 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210037 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30119 | |
| i | 30000 | |
| e | 30000 | |
| d | 20052 | |
| a | 10067 | 4.8% |
| M | 10067 | 4.8% |
| D | 9985 | 4.8% |
| v | 9985 | 4.8% |
| o | 9985 | 4.8% |
| c | 9985 | 4.8% |
| Other values (4) | 39792 |
number_of_children
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 2.000554 | 1.998133333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 4 | 4 |
| Zeros | 199753 | 5930 |
| Zeros (%) | 20.0% | 19.8% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 1 | 1 |
| median | 2 | 2 |
| Q3 | 3 | 3 |
| 95-th percentile | 4 | 4 |
| Maximum | 4 | 4 |
| Range | 4 | 4 |
| Interquartile range (IQR) | 2 | 2 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 1.414214161 | 1.407644332 |
| Coefficient of variation (CV) | 0.7069112661 | 0.7044796802 |
| Kurtosis | -1.300270709 | -1.290005138 |
| Mean | 2.000554 | 1.998133333 |
| Median Absolute Deviation (MAD) | 1 | 1 |
| Skewness | -0.0001223295646 | 0.001157855148 |
| Sum | 2000554 | 59944 |
| Variance | 2.000001693 | 1.981462564 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 200307 | |
| 4 | 200157 | |
| 3 | 200053 | |
| 0 | 199753 | |
| 2 | 199730 |
| Value | Count | Frequency (%) |
| 3 | 6063 | |
| 1 | 6059 | |
| 2 | 6048 | |
| 0 | 5930 | |
| 4 | 5900 |
| Value | Count | Frequency (%) |
| 0 | 199753 | |
| 1 | 200307 | |
| 2 | 199730 | |
| 3 | 200053 | |
| 4 | 200157 |
| Value | Count | Frequency (%) |
| 0 | 5930 | |
| 1 | 6059 | |
| 2 | 6048 | |
| 3 | 6063 | |
| 4 | 5900 |
| Value | Count | Frequency (%) |
| 0 | 5930 | |
| 1 | 6059 | |
| 2 | 6048 | |
| 3 | 6063 | |
| 4 | 5900 |
| Value | Count | Frequency (%) |
| 0 | 199753 | |
| 1 | 200307 | |
| 2 | 199730 | |
| 3 | 200053 | |
| 4 | 200157 |
education_level
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 10 | 10 |
| Mean length | 8.00064 | 8.017833333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Bachelor's | Bachelor's |
| 2nd row | PhD | High School |
| 3rd row | Bachelor's | PhD |
| 4th row | Master's | Bachelor's |
| 5th row | Bachelor's | Master's |
| Value | Count | Frequency (%) |
| bachelor's | 250360 | |
| high | 250105 | |
| school | 250105 | |
| phd | 250079 | |
| master's | 249456 |
| Value | Count | Frequency (%) |
| high | 7575 | |
| school | 7575 | |
| bachelor's | 7565 | |
| phd | 7464 | |
| master's | 7396 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 30179 | |
| o | 22715 | 9.4% |
| s | 22357 | 9.3% |
| c | 15140 | 6.3% |
| l | 15140 | 6.3% |
| e | 14961 | 6.2% |
| a | 14961 | 6.2% |
| ' | 14961 | 6.2% |
| r | 14961 | 6.2% |
| H | 7575 | 3.1% |
| Other values (9) | 67585 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240535 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 30179 | |
| o | 22715 | 9.4% |
| s | 22357 | 9.3% |
| c | 15140 | 6.3% |
| l | 15140 | 6.3% |
| e | 14961 | 6.2% |
| a | 14961 | 6.2% |
| ' | 14961 | 6.2% |
| r | 14961 | 6.2% |
| H | 7575 | 3.1% |
| Other values (9) | 67585 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240535 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 30179 | |
| o | 22715 | 9.4% |
| s | 22357 | 9.3% |
| c | 15140 | 6.3% |
| l | 15140 | 6.3% |
| e | 14961 | 6.2% |
| a | 14961 | 6.2% |
| ' | 14961 | 6.2% |
| r | 14961 | 6.2% |
| H | 7575 | 3.1% |
| Other values (9) | 67585 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240535 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 30179 | |
| o | 22715 | 9.4% |
| s | 22357 | 9.3% |
| c | 15140 | 6.3% |
| l | 15140 | 6.3% |
| e | 14961 | 6.2% |
| a | 14961 | 6.2% |
| ' | 14961 | 6.2% |
| r | 14961 | 6.2% |
| H | 7575 | 3.1% |
| Other values (9) | 67585 |
occupation
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 10 | 10 |
| Mean length | 9.500854 | 9.497766667 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Self-Employed | Retired |
| 2nd row | Unemployed | Retired |
| 3rd row | Self-Employed | Employed |
| 4th row | Employed | Self-Employed |
| 5th row | Employed | Employed |
| Value | Count | Frequency (%) |
| employed | 250857 | |
| unemployed | 250117 | |
| self-employed | 249941 | |
| retired | 249085 |
| Value | Count | Frequency (%) |
| retired | 7545 | |
| self-employed | 7534 | |
| employed | 7517 | |
| unemployed | 7404 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52483 | |
| d | 30000 | |
| l | 29989 | |
| o | 22455 | |
| p | 22455 | |
| m | 22455 | |
| y | 22455 | |
| E | 15051 | 5.3% |
| R | 7545 | 2.6% |
| t | 7545 | 2.6% |
| Other values (7) | 52500 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 284933 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52483 | |
| d | 30000 | |
| l | 29989 | |
| o | 22455 | |
| p | 22455 | |
| m | 22455 | |
| y | 22455 | |
| E | 15051 | 5.3% |
| R | 7545 | 2.6% |
| t | 7545 | 2.6% |
| Other values (7) | 52500 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 284933 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52483 | |
| d | 30000 | |
| l | 29989 | |
| o | 22455 | |
| p | 22455 | |
| m | 22455 | |
| y | 22455 | |
| E | 15051 | 5.3% |
| R | 7545 | 2.6% |
| t | 7545 | 2.6% |
| Other values (7) | 52500 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 284933 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52483 | |
| d | 30000 | |
| l | 29989 | |
| o | 22455 | |
| p | 22455 | |
| m | 22455 | |
| y | 22455 | |
| E | 15051 | 5.3% |
| R | 7545 | 2.6% |
| t | 7545 | 2.6% |
| Other values (7) | 52500 |
transaction_id
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 632576 | 29538 |
| Distinct (%) | 63.3% | 98.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499891.7314 | 502817.3992 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 2 | 30 |
| Maximum | 999999 | 999881 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 2 | 30 |
| 5-th percentile | 50200.95 | 51972.95 |
| Q1 | 249878.75 | 255453.25 |
| median | 499559.5 | 502048.5 |
| Q3 | 750071.25 | 751640.75 |
| 95-th percentile | 950045.2 | 950411.5 |
| Maximum | 999999 | 999881 |
| Range | 999997 | 999851 |
| Interquartile range (IQR) | 500192.5 | 496187.5 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288706.0577 | 287335.4613 |
| Coefficient of variation (CV) | 0.5775371735 | 0.5714509119 |
| Kurtosis | -1.200114605 | -1.184605739 |
| Mean | 499891.7314 | 502817.3992 |
| Median Absolute Deviation (MAD) | 250088.5 | 248150.5 |
| Skewness | 0.002395187253 | -0.009883747709 |
| Sum | 4.998917314 × 1011 | 1.508452198 × 1010 |
| Variance | 8.335118772 × 1010 | 8.256166732 × 1010 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 115913 | 9 | < 0.1% |
| 504562 | 8 | < 0.1% |
| 344167 | 8 | < 0.1% |
| 2773 | 8 | < 0.1% |
| 239407 | 8 | < 0.1% |
| 620816 | 8 | < 0.1% |
| 273197 | 8 | < 0.1% |
| 254678 | 7 | < 0.1% |
| 798940 | 7 | < 0.1% |
| 335691 | 7 | < 0.1% |
| Other values (632566) | 999922 |
| Value | Count | Frequency (%) |
| 491609 | 3 | < 0.1% |
| 6554 | 3 | < 0.1% |
| 318501 | 3 | < 0.1% |
| 346938 | 3 | < 0.1% |
| 832355 | 3 | < 0.1% |
| 974493 | 2 | < 0.1% |
| 233193 | 2 | < 0.1% |
| 108085 | 2 | < 0.1% |
| 496964 | 2 | < 0.1% |
| 616078 | 2 | < 0.1% |
| Other values (29528) | 29975 |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 1 | < 0.1% |
| 7 | 2 |
| Value | Count | Frequency (%) |
| 30 | 1 | |
| 36 | 1 | |
| 55 | 1 | |
| 63 | 1 | |
| 69 | 1 |
| Value | Count | Frequency (%) |
| 30 | 1 | |
| 36 | 1 | |
| 55 | 1 | |
| 63 | 1 | |
| 69 | 1 |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 1 | < 0.1% |
| 7 | 2 |
transaction_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 992231 | 29990 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 984504 | 29980 ? |
| Unique (%) | 98.5% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2020-10-11 10:08:52 | 2021-01-28 14:56:18 |
| 2nd row | 2021-12-08 01:07:40 | 2021-08-05 22:27:14 |
| 3rd row | 2020-02-17 09:40:48 | 2020-11-29 01:27:56 |
| 4th row | 2020-08-13 00:43:14 | 2020-10-20 00:00:36 |
| 5th row | 2021-07-02 11:59:03 | 2021-01-01 15:53:55 |
| Value | Count | Frequency (%) |
| 2020-10-05 | 1509 | 0.1% |
| 2020-09-06 | 1467 | 0.1% |
| 2020-10-04 | 1464 | 0.1% |
| 2020-07-26 | 1463 | 0.1% |
| 2020-02-26 | 1458 | 0.1% |
| 2020-05-03 | 1455 | 0.1% |
| 2021-02-27 | 1453 | 0.1% |
| 2021-07-30 | 1451 | 0.1% |
| 2020-09-07 | 1451 | 0.1% |
| 2020-10-09 | 1447 | 0.1% |
| Other values (87119) | 1985382 |
| Value | Count | Frequency (%) |
| 2021-08-13 | 61 | 0.1% |
| 2020-10-17 | 60 | 0.1% |
| 2020-11-27 | 60 | 0.1% |
| 2021-11-09 | 60 | 0.1% |
| 2021-11-24 | 59 | 0.1% |
| 2021-06-25 | 57 | 0.1% |
| 2021-08-24 | 57 | 0.1% |
| 2020-09-11 | 56 | 0.1% |
| 2021-04-23 | 56 | 0.1% |
| 2021-12-15 | 56 | 0.1% |
| Other values (26058) | 59418 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 113957 | |
| 2 | 102327 | |
| 1 | 73132 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26926 | 4.7% |
| 5 | 23976 | 4.2% |
| 4 | 23960 | 4.2% |
| 7 | 14065 | 2.5% |
| Other values (3) | 41657 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 113957 | |
| 2 | 102327 | |
| 1 | 73132 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26926 | 4.7% |
| 5 | 23976 | 4.2% |
| 4 | 23960 | 4.2% |
| 7 | 14065 | 2.5% |
| Other values (3) | 41657 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 113957 | |
| 2 | 102327 | |
| 1 | 73132 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26926 | 4.7% |
| 5 | 23976 | 4.2% |
| 4 | 23960 | 4.2% |
| 7 | 14065 | 2.5% |
| Other values (3) | 41657 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 113957 | |
| 2 | 102327 | |
| 1 | 73132 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26926 | 4.7% |
| 5 | 23976 | 4.2% |
| 4 | 23960 | 4.2% |
| 7 | 14065 | 2.5% |
| Other values (3) | 41657 | 7.3% |
product_id
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 9999 | 9507 |
| Distinct (%) | 1.0% | 31.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4999.564515 | 5014.683167 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 9999 | 9999 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 500 | 510 |
| Q1 | 2498 | 2528 |
| median | 4999 | 5026.5 |
| Q3 | 7498 | 7510.25 |
| 95-th percentile | 9499 | 9490.05 |
| Maximum | 9999 | 9999 |
| Range | 9998 | 9998 |
| Interquartile range (IQR) | 5000 | 4982.25 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2886.798391 | 2882.480717 |
| Coefficient of variation (CV) | 0.5774099689 | 0.5748081426 |
| Kurtosis | -1.200144352 | -1.197819312 |
| Mean | 4999.564515 | 5014.683167 |
| Median Absolute Deviation (MAD) | 2500 | 2490.5 |
| Skewness | 0.0002346107222 | -0.00782393366 |
| Sum | 4999564515 | 150440495 |
| Variance | 8333604.95 | 8308695.082 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4898 | 145 | < 0.1% |
| 51 | 143 | < 0.1% |
| 9593 | 141 | < 0.1% |
| 5427 | 138 | < 0.1% |
| 3923 | 137 | < 0.1% |
| 8365 | 135 | < 0.1% |
| 4541 | 134 | < 0.1% |
| 2590 | 134 | < 0.1% |
| 467 | 133 | < 0.1% |
| 3676 | 133 | < 0.1% |
| Other values (9989) | 998627 |
| Value | Count | Frequency (%) |
| 3589 | 10 | < 0.1% |
| 1491 | 10 | < 0.1% |
| 2635 | 10 | < 0.1% |
| 4705 | 10 | < 0.1% |
| 6613 | 10 | < 0.1% |
| 5665 | 10 | < 0.1% |
| 6931 | 9 | < 0.1% |
| 5435 | 9 | < 0.1% |
| 278 | 9 | < 0.1% |
| 5777 | 9 | < 0.1% |
| Other values (9497) | 29904 |
| Value | Count | Frequency (%) |
| 1 | 92 | |
| 2 | 107 | |
| 3 | 117 | |
| 4 | 97 | |
| 5 | 92 |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | 4 | |
| 3 | 3 | |
| 4 | 3 | |
| 5 | 3 |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | 4 | |
| 3 | 3 | |
| 4 | 3 | |
| 5 | 3 |
| Value | Count | Frequency (%) |
| 1 | 92 | |
| 2 | 107 | |
| 3 | 117 | |
| 4 | 97 | |
| 5 | 92 |
product_category
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 9 | 9 |
| Mean length | 8.196389 | 8.208833333 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Electronics | Clothing |
| 2nd row | Groceries | Electronics |
| 3rd row | Toys | Groceries |
| 4th row | Toys | Furniture |
| 5th row | Clothing | Furniture |
| Value | Count | Frequency (%) |
| toys | 200669 | |
| groceries | 200214 | |
| clothing | 199778 | |
| electronics | 199756 | |
| furniture | 199583 |
| Value | Count | Frequency (%) |
| groceries | 6036 | |
| furniture | 6016 | |
| clothing | 6010 | |
| electronics | 5995 | |
| toys | 5943 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30099 | |
| e | 24083 | |
| i | 24057 | |
| o | 23984 | |
| c | 18026 | 7.3% |
| n | 18021 | 7.3% |
| t | 18021 | 7.3% |
| s | 17974 | 7.3% |
| u | 12032 | 4.9% |
| l | 12005 | 4.9% |
| Other values (8) | 47963 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246265 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30099 | |
| e | 24083 | |
| i | 24057 | |
| o | 23984 | |
| c | 18026 | 7.3% |
| n | 18021 | 7.3% |
| t | 18021 | 7.3% |
| s | 17974 | 7.3% |
| u | 12032 | 4.9% |
| l | 12005 | 4.9% |
| Other values (8) | 47963 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246265 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30099 | |
| e | 24083 | |
| i | 24057 | |
| o | 23984 | |
| c | 18026 | 7.3% |
| n | 18021 | 7.3% |
| t | 18021 | 7.3% |
| s | 17974 | 7.3% |
| u | 12032 | 4.9% |
| l | 12005 | 4.9% |
| Other values (8) | 47963 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246265 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30099 | |
| e | 24083 | |
| i | 24057 | |
| o | 23984 | |
| c | 18026 | 7.3% |
| n | 18021 | 7.3% |
| t | 18021 | 7.3% |
| s | 17974 | 7.3% |
| u | 12032 | 4.9% |
| l | 12005 | 4.9% |
| Other values (8) | 47963 |
quantity
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 9 | 9 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.002649 | 5.016133333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 9 | 9 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 3 | 3 |
| median | 5 | 5 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 8 | 8 |
| Interquartile range (IQR) | 4 | 4 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.583751276 | 2.584742878 |
| Coefficient of variation (CV) | 0.516476626 | 0.5152859197 |
| Kurtosis | -1.231080652 | -1.230965354 |
| Mean | 5.002649 | 5.016133333 |
| Median Absolute Deviation (MAD) | 2 | 2 |
| Skewness | -0.0003647460673 | -0.007367687857 |
| Sum | 5002649 | 150484 |
| Variance | 6.675770659 | 6.680895745 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 111914 | |
| 3 | 111422 | |
| 7 | 111274 | |
| 1 | 111150 | |
| 4 | 111104 | |
| 6 | 111098 | |
| 2 | 110782 | |
| 8 | 110747 | |
| 5 | 110509 |
| Value | Count | Frequency (%) |
| 9 | 3383 | |
| 7 | 3364 | |
| 8 | 3354 | |
| 4 | 3344 | |
| 5 | 3342 | |
| 1 | 3327 | |
| 3 | 3316 | |
| 6 | 3289 | |
| 2 | 3281 |
| Value | Count | Frequency (%) |
| 1 | 111150 | |
| 2 | 110782 | |
| 3 | 111422 | |
| 4 | 111104 | |
| 5 | 110509 |
| Value | Count | Frequency (%) |
| 1 | 3327 | |
| 2 | 3281 | |
| 3 | 3316 | |
| 4 | 3344 | |
| 5 | 3342 |
| Value | Count | Frequency (%) |
| 1 | 3327 | |
| 2 | 3281 | |
| 3 | 3316 | |
| 4 | 3344 | |
| 5 | 3342 |
| Value | Count | Frequency (%) |
| 1 | 111150 | |
| 2 | 110782 | |
| 3 | 111422 | |
| 4 | 111104 | |
| 5 | 110509 |
unit_price
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 99896 | 25919 |
| Distinct (%) | 10.0% | 86.4% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500.2613169 | 499.671594 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 1000 | 999.96 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 50.72 | 50.889 |
| Q1 | 250.31 | 248.755 |
| median | 500.41 | 498.76 |
| Q3 | 750.16 | 751.6825 |
| 95-th percentile | 949.91 | 949.4015 |
| Maximum | 1000 | 999.96 |
| Range | 999 | 998.96 |
| Interquartile range (IQR) | 499.85 | 502.9275 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288.4628596 | 288.6536702 |
| Coefficient of variation (CV) | 0.5766243559 | 0.5776867719 |
| Kurtosis | -1.20144233 | -1.198680323 |
| Mean | 500.2613169 | 499.671594 |
| Median Absolute Deviation (MAD) | 249.93 | 251.41 |
| Skewness | -1.097330655 × 10-5 | 0.00333989629 |
| Sum | 500261316.9 | 14990147.82 |
| Variance | 83210.82139 | 83320.94129 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 226.51 | 28 | < 0.1% |
| 450.02 | 26 | < 0.1% |
| 591.8 | 25 | < 0.1% |
| 921.47 | 25 | < 0.1% |
| 354.83 | 25 | < 0.1% |
| 49.69 | 25 | < 0.1% |
| 111.41 | 24 | < 0.1% |
| 954.1 | 24 | < 0.1% |
| 619.19 | 24 | < 0.1% |
| 845.21 | 24 | < 0.1% |
| Other values (99886) | 999750 |
| Value | Count | Frequency (%) |
| 998.99 | 5 | < 0.1% |
| 640.48 | 5 | < 0.1% |
| 697.24 | 4 | < 0.1% |
| 530.24 | 4 | < 0.1% |
| 742.95 | 4 | < 0.1% |
| 855.69 | 4 | < 0.1% |
| 729.07 | 4 | < 0.1% |
| 765.64 | 4 | < 0.1% |
| 119.68 | 4 | < 0.1% |
| 720.62 | 4 | < 0.1% |
| Other values (25909) | 29958 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 1.01 | 9 | |
| 1.02 | 11 | |
| 1.03 | 8 | |
| 1.04 | 17 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 1.03 | 1 | |
| 1.04 | 1 | |
| 1.06 | 1 | |
| 1.08 | 2 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 1.03 | 1 | |
| 1.04 | 1 | |
| 1.06 | 1 | |
| 1.08 | 2 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 1.01 | 9 | |
| 1.02 | 11 | |
| 1.03 | 8 | |
| 1.04 | 17 |
discount_applied
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.24991049 | 0.2490266667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 9967 | 280 |
| Zeros (%) | 1.0% | 0.9% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.03 |
| Q1 | 0.13 | 0.12 |
| median | 0.25 | 0.25 |
| Q3 | 0.37 | 0.37 |
| 95-th percentile | 0.47 | 0.47 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.24 | 0.25 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 0.1443279083 | 0.1439335605 |
| Coefficient of variation (CV) | 0.5775184079 | 0.5779845286 |
| Kurtosis | -1.19713108 | -1.195219351 |
| Mean | 0.24991049 | 0.2490266667 |
| Median Absolute Deviation (MAD) | 0.12 | 0.12 |
| Skewness | 0.0002640976336 | 0.01064049997 |
| Sum | 249910.49 | 7470.8 |
| Variance | 0.02083054512 | 0.02071686985 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.19 | 20302 | 2.0% |
| 0.06 | 20213 | 2.0% |
| 0.34 | 20211 | 2.0% |
| 0.03 | 20207 | 2.0% |
| 0.05 | 20207 | 2.0% |
| 0.21 | 20199 | 2.0% |
| 0.29 | 20155 | 2.0% |
| 0.07 | 20153 | 2.0% |
| 0.43 | 20145 | 2.0% |
| 0.18 | 20111 | 2.0% |
| Other values (41) | 798097 |
| Value | Count | Frequency (%) |
| 0.07 | 658 | 2.2% |
| 0.4 | 658 | 2.2% |
| 0.12 | 645 | 2.1% |
| 0.39 | 632 | 2.1% |
| 0.03 | 631 | 2.1% |
| 0.37 | 628 | 2.1% |
| 0.13 | 628 | 2.1% |
| 0.24 | 627 | 2.1% |
| 0.15 | 626 | 2.1% |
| 0.19 | 625 | 2.1% |
| Other values (41) | 23642 |
| Value | Count | Frequency (%) |
| 0 | 9967 | |
| 0.01 | 20018 | |
| 0.02 | 19788 | |
| 0.03 | 20207 | |
| 0.04 | 19947 |
| Value | Count | Frequency (%) |
| 0 | 280 | |
| 0.01 | 570 | |
| 0.02 | 613 | |
| 0.03 | 631 | |
| 0.04 | 594 |
| Value | Count | Frequency (%) |
| 0 | 280 | |
| 0.01 | 570 | |
| 0.02 | 613 | |
| 0.03 | 631 | |
| 0.04 | 594 |
| Value | Count | Frequency (%) |
| 0 | 9967 | |
| 0.01 | 20018 | |
| 0.02 | 19788 | |
| 0.03 | 20207 | |
| 0.04 | 19947 |
payment_method
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 14 | 14 |
| Median length | 11 | 11 |
| Mean length | 9.751935 | 9.7828 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Credit Card | Debit Card |
| 2nd row | Credit Card | Debit Card |
| 3rd row | Debit Card | Debit Card |
| 4th row | Credit Card | Debit Card |
| 5th row | Mobile Payment | Debit Card |
| Value | Count | Frequency (%) |
| card | 500200 | |
| credit | 250435 | |
| mobile | 250030 | |
| payment | 250030 | |
| cash | 249770 | |
| debit | 249765 |
| Value | Count | Frequency (%) |
| card | 14999 | |
| debit | 7619 | |
| mobile | 7611 | |
| payment | 7611 | |
| cash | 7390 | |
| credit | 7380 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30221 | |
| a | 30000 | |
| C | 29769 | |
| 22610 | 7.7% | |
| i | 22610 | 7.7% |
| t | 22610 | 7.7% |
| r | 22379 | 7.6% |
| d | 22379 | 7.6% |
| b | 15230 | 5.2% |
| D | 7619 | 2.6% |
| Other values (9) | 68057 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 293484 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30221 | |
| a | 30000 | |
| C | 29769 | |
| 22610 | 7.7% | |
| i | 22610 | 7.7% |
| t | 22610 | 7.7% |
| r | 22379 | 7.6% |
| d | 22379 | 7.6% |
| b | 15230 | 5.2% |
| D | 7619 | 2.6% |
| Other values (9) | 68057 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 293484 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30221 | |
| a | 30000 | |
| C | 29769 | |
| 22610 | 7.7% | |
| i | 22610 | 7.7% |
| t | 22610 | 7.7% |
| r | 22379 | 7.6% |
| d | 22379 | 7.6% |
| b | 15230 | 5.2% |
| D | 7619 | 2.6% |
| Other values (9) | 68057 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 293484 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30221 | |
| a | 30000 | |
| C | 29769 | |
| 22610 | 7.7% | |
| i | 22610 | 7.7% |
| t | 22610 | 7.7% |
| r | 22379 | 7.6% |
| d | 22379 | 7.6% |
| b | 15230 | 5.2% |
| D | 7619 | 2.6% |
| Other values (9) | 68057 |
store_location
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 10 | 10 |
| Mean length | 10 | 10 |
| Min length | 10 | 10 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Location A | Location C |
| 2nd row | Location C | Location A |
| 3rd row | Location A | Location B |
| 4th row | Location A | Location C |
| 5th row | Location C | Location B |
| Value | Count | Frequency (%) |
| location | 1000000 | |
| c | 250336 | 12.5% |
| b | 250280 | 12.5% |
| a | 250150 | 12.5% |
| d | 249234 | 12.5% |
| Value | Count | Frequency (%) |
| location | 30000 | |
| c | 7569 | 12.6% |
| a | 7562 | 12.6% |
| b | 7501 | 12.5% |
| d | 7368 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| C | 7569 | 2.5% |
| A | 7562 | 2.5% |
| Other values (2) | 14869 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| C | 7569 | 2.5% |
| A | 7562 | 2.5% |
| Other values (2) | 14869 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| C | 7569 | 2.5% |
| A | 7562 | 2.5% |
| Other values (2) | 14869 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| C | 7569 | 2.5% |
| A | 7562 | 2.5% |
| Other values (2) | 14869 | 5.0% |
transaction_hour
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 24 | 24 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 11.505193 | 11.50666667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 23 | 23 |
| Zeros | 41756 | 1277 |
| Zeros (%) | 4.2% | 4.3% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 1 | 1 |
| Q1 | 5 | 6 |
| median | 12 | 11 |
| Q3 | 18 | 18 |
| 95-th percentile | 22 | 22 |
| Maximum | 23 | 23 |
| Range | 23 | 23 |
| Interquartile range (IQR) | 13 | 12 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 6.924459761 | 6.939118143 |
| Coefficient of variation (CV) | 0.6018551589 | 0.6030519823 |
| Kurtosis | -1.205305317 | -1.202551467 |
| Mean | 11.505193 | 11.50666667 |
| Median Absolute Deviation (MAD) | 6 | 6 |
| Skewness | -0.001531297707 | -0.002854367692 |
| Sum | 11505193 | 345200 |
| Variance | 47.94814298 | 48.1513606 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 42166 | 4.2% |
| 14 | 42161 | 4.2% |
| 18 | 41872 | 4.2% |
| 20 | 41812 | 4.2% |
| 3 | 41780 | 4.2% |
| 21 | 41778 | 4.2% |
| 4 | 41756 | 4.2% |
| 0 | 41756 | 4.2% |
| 23 | 41750 | 4.2% |
| 19 | 41707 | 4.2% |
| Other values (14) | 581462 |
| Value | Count | Frequency (%) |
| 10 | 1292 | 4.3% |
| 1 | 1287 | 4.3% |
| 20 | 1286 | 4.3% |
| 23 | 1283 | 4.3% |
| 22 | 1280 | 4.3% |
| 2 | 1279 | 4.3% |
| 0 | 1277 | 4.3% |
| 14 | 1274 | 4.2% |
| 17 | 1258 | 4.2% |
| 11 | 1257 | 4.2% |
| Other values (14) | 17227 |
| Value | Count | Frequency (%) |
| 0 | 41756 | |
| 1 | 41637 | |
| 2 | 41388 | |
| 3 | 41780 | |
| 4 | 41756 |
| Value | Count | Frequency (%) |
| 0 | 1277 | |
| 1 | 1287 | |
| 2 | 1279 | |
| 3 | 1202 | |
| 4 | 1196 |
| Value | Count | Frequency (%) |
| 0 | 1277 | |
| 1 | 1287 | |
| 2 | 1279 | |
| 3 | 1202 | |
| 4 | 1196 |
| Value | Count | Frequency (%) |
| 0 | 41756 | |
| 1 | 41637 | |
| 2 | 41388 | |
| 3 | 41780 | |
| 4 | 41756 |
day_of_week
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 7 | 7 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 9 | 9 |
| Median length | 8 | 8 |
| Mean length | 7.141075 | 7.146633333 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Wednesday | Friday |
| 2nd row | Friday | Saturday |
| 3rd row | Saturday | Sunday |
| 4th row | Friday | Sunday |
| 5th row | Monday | Friday |
| Value | Count | Frequency (%) |
| tuesday | 143452 | |
| friday | 143067 | |
| thursday | 142930 | |
| sunday | 142875 | |
| monday | 142855 | |
| saturday | 142700 | |
| wednesday | 142121 |
| Value | Count | Frequency (%) |
| tuesday | 4332 | |
| sunday | 4319 | |
| wednesday | 4311 | |
| thursday | 4286 | |
| monday | 4282 | |
| saturday | 4281 | |
| friday | 4189 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| d | 34311 | |
| a | 34281 | |
| y | 30000 | |
| u | 17218 | |
| e | 12954 | 6.0% |
| s | 12929 | 6.0% |
| n | 12912 | 6.0% |
| r | 12756 | 5.9% |
| T | 8618 | 4.0% |
| S | 8600 | 4.0% |
| Other values (7) | 29820 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214399 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| d | 34311 | |
| a | 34281 | |
| y | 30000 | |
| u | 17218 | |
| e | 12954 | 6.0% |
| s | 12929 | 6.0% |
| n | 12912 | 6.0% |
| r | 12756 | 5.9% |
| T | 8618 | 4.0% |
| S | 8600 | 4.0% |
| Other values (7) | 29820 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214399 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| d | 34311 | |
| a | 34281 | |
| y | 30000 | |
| u | 17218 | |
| e | 12954 | 6.0% |
| s | 12929 | 6.0% |
| n | 12912 | 6.0% |
| r | 12756 | 5.9% |
| T | 8618 | 4.0% |
| S | 8600 | 4.0% |
| Other values (7) | 29820 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214399 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| d | 34311 | |
| a | 34281 | |
| y | 30000 | |
| u | 17218 | |
| e | 12954 | 6.0% |
| s | 12929 | 6.0% |
| n | 12912 | 6.0% |
| r | 12756 | 5.9% |
| T | 8618 | 4.0% |
| S | 8600 | 4.0% |
| Other values (7) | 29820 |
week_of_year
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 52 | 52 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 26.503691 | 26.49406667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 52 | 52 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 3 | 3 |
| Q1 | 14 | 13 |
| median | 27 | 26 |
| Q3 | 39 | 40 |
| 95-th percentile | 50 | 50 |
| Maximum | 52 | 52 |
| Range | 51 | 51 |
| Interquartile range (IQR) | 25 | 27 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 15.00516516 | 15.04609953 |
| Coefficient of variation (CV) | 0.566153792 | 0.5679044941 |
| Kurtosis | -1.199199248 | -1.207693619 |
| Mean | 26.503691 | 26.49406667 |
| Median Absolute Deviation (MAD) | 13 | 13 |
| Skewness | -0.0005909978351 | 0.002932240747 |
| Sum | 26503691 | 794822 |
| Variance | 225.1549815 | 226.385111 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 27 | 19588 | 2.0% |
| 19 | 19507 | 2.0% |
| 51 | 19447 | 1.9% |
| 26 | 19425 | 1.9% |
| 1 | 19399 | 1.9% |
| 25 | 19386 | 1.9% |
| 21 | 19371 | 1.9% |
| 44 | 19356 | 1.9% |
| 16 | 19348 | 1.9% |
| 9 | 19340 | 1.9% |
| Other values (42) | 805833 |
| Value | Count | Frequency (%) |
| 10 | 624 | 2.1% |
| 32 | 624 | 2.1% |
| 19 | 620 | 2.1% |
| 43 | 611 | 2.0% |
| 47 | 606 | 2.0% |
| 3 | 606 | 2.0% |
| 21 | 604 | 2.0% |
| 25 | 596 | 2.0% |
| 17 | 595 | 2.0% |
| 9 | 593 | 2.0% |
| Other values (42) | 23921 |
| Value | Count | Frequency (%) |
| 1 | 19399 | |
| 2 | 19179 | |
| 3 | 19150 | |
| 4 | 19137 | |
| 5 | 19328 |
| Value | Count | Frequency (%) |
| 1 | 570 | |
| 2 | 592 | |
| 3 | 606 | |
| 4 | 556 | |
| 5 | 586 |
| Value | Count | Frequency (%) |
| 1 | 570 | |
| 2 | 592 | |
| 3 | 606 | |
| 4 | 556 | |
| 5 | 586 |
| Value | Count | Frequency (%) |
| 1 | 19399 | |
| 2 | 19179 | |
| 3 | 19150 | |
| 4 | 19137 | |
| 5 | 19328 |
month_of_year
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 12 | 12 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 6.497467 | 6.4968 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 12 | 12 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 3 | 3 |
| median | 7 | 6 |
| Q3 | 10 | 10 |
| 95-th percentile | 12 | 12 |
| Maximum | 12 | 12 |
| Range | 11 | 11 |
| Interquartile range (IQR) | 7 | 7 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 3.455211936 | 3.457695034 |
| Coefficient of variation (CV) | 0.531778297 | 0.5322150957 |
| Kurtosis | -1.21903855 | -1.218478263 |
| Mean | 6.497467 | 6.4968 |
| Median Absolute Deviation (MAD) | 3 | 3 |
| Skewness | 0.0003820436436 | 0.002125819773 |
| Sum | 6497467 | 194904 |
| Variance | 11.93848952 | 11.95565495 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 83951 | |
| 11 | 83645 | |
| 1 | 83624 | |
| 7 | 83475 | |
| 12 | 83353 | |
| 9 | 83328 | |
| 3 | 83244 | |
| 5 | 83135 | |
| 10 | 83113 | |
| 8 | 83093 | |
| Other values (2) | 166039 |
| Value | Count | Frequency (%) |
| 10 | 2541 | |
| 2 | 2538 | |
| 7 | 2537 | |
| 6 | 2526 | |
| 12 | 2521 | |
| 1 | 2515 | |
| 5 | 2514 | |
| 11 | 2489 | |
| 4 | 2473 | |
| 3 | 2466 | |
| Other values (2) | 4880 |
| Value | Count | Frequency (%) |
| 1 | 83624 | |
| 2 | 83951 | |
| 3 | 83244 | |
| 4 | 83091 | |
| 5 | 83135 |
| Value | Count | Frequency (%) |
| 1 | 2515 | |
| 2 | 2538 | |
| 3 | 2466 | |
| 4 | 2473 | |
| 5 | 2514 |
| Value | Count | Frequency (%) |
| 1 | 2515 | |
| 2 | 2538 | |
| 3 | 2466 | |
| 4 | 2473 | |
| 5 | 2514 |
| Value | Count | Frequency (%) |
| 1 | 83624 | |
| 2 | 83951 | |
| 3 | 83244 | |
| 4 | 83091 | |
| 5 | 83135 |
avg_purchase_value
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 49001 | 22451 |
| Distinct (%) | 4.9% | 74.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 254.8864443 | 255.0930827 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.01 |
| Maximum | 500 | 499.97 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.01 |
| 5-th percentile | 34.4 | 33.79 |
| Q1 | 132.22 | 131.975 |
| median | 254.93 | 256.13 |
| Q3 | 377.35 | 378.32 |
| 95-th percentile | 475.56 | 476.0405 |
| Maximum | 500 | 499.97 |
| Range | 490 | 489.96 |
| Interquartile range (IQR) | 245.13 | 246.345 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 141.4949233 | 142.102325 |
| Coefficient of variation (CV) | 0.5551292604 | 0.5570606757 |
| Kurtosis | -1.200170422 | -1.205574492 |
| Mean | 254.8864443 | 255.0930827 |
| Median Absolute Deviation (MAD) | 122.57 | 123.115 |
| Skewness | 0.0003762833586 | -0.001960015767 |
| Sum | 254886444.3 | 7652792.48 |
| Variance | 20020.81333 | 20193.07077 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 76.54 | 41 | < 0.1% |
| 482.75 | 41 | < 0.1% |
| 372.04 | 40 | < 0.1% |
| 397.45 | 39 | < 0.1% |
| 246.87 | 39 | < 0.1% |
| 60.53 | 38 | < 0.1% |
| 278.34 | 38 | < 0.1% |
| 315.26 | 38 | < 0.1% |
| 165.81 | 38 | < 0.1% |
| 492.47 | 38 | < 0.1% |
| Other values (48991) | 999610 |
| Value | Count | Frequency (%) |
| 78.3 | 5 | < 0.1% |
| 194.92 | 5 | < 0.1% |
| 180.8 | 5 | < 0.1% |
| 228.66 | 5 | < 0.1% |
| 282.32 | 5 | < 0.1% |
| 158.4 | 5 | < 0.1% |
| 492.22 | 5 | < 0.1% |
| 373.44 | 5 | < 0.1% |
| 102.51 | 5 | < 0.1% |
| 78.13 | 5 | < 0.1% |
| Other values (22441) | 29950 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 23 | |
| 10.02 | 29 | |
| 10.03 | 17 | |
| 10.04 | 21 |
| Value | Count | Frequency (%) |
| 10.01 | 2 | |
| 10.02 | 1 | < 0.1% |
| 10.03 | 1 | < 0.1% |
| 10.04 | 1 | < 0.1% |
| 10.05 | 3 |
| Value | Count | Frequency (%) |
| 10.01 | 2 | |
| 10.02 | 1 | < 0.1% |
| 10.03 | 1 | < 0.1% |
| 10.04 | 1 | < 0.1% |
| 10.05 | 3 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 23 | |
| 10.02 | 29 | |
| 10.03 | 17 | |
| 10.04 | 21 |
purchase_frequency
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 6 | 6 |
| Mean length | 6.000399 | 5.992 |
| Min length | 5 | 5 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Weekly | Yearly |
| 2nd row | Daily | Weekly |
| 3rd row | Weekly | Weekly |
| 4th row | Weekly | Weekly |
| 5th row | Yearly | Monthly |
| Value | Count | Frequency (%) |
| yearly | 250767 | |
| monthly | 249932 | |
| weekly | 249768 | |
| daily | 249533 |
| Value | Count | Frequency (%) |
| weekly | 7579 | |
| daily | 7574 | |
| yearly | 7513 | |
| monthly | 7334 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22671 | |
| a | 15087 | |
| W | 7579 | 4.2% |
| k | 7579 | 4.2% |
| D | 7574 | 4.2% |
| i | 7574 | 4.2% |
| Y | 7513 | 4.2% |
| r | 7513 | 4.2% |
| Other values (5) | 36670 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22671 | |
| a | 15087 | |
| W | 7579 | 4.2% |
| k | 7579 | 4.2% |
| D | 7574 | 4.2% |
| i | 7574 | 4.2% |
| Y | 7513 | 4.2% |
| r | 7513 | 4.2% |
| Other values (5) | 36670 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22671 | |
| a | 15087 | |
| W | 7579 | 4.2% |
| k | 7579 | 4.2% |
| D | 7574 | 4.2% |
| i | 7574 | 4.2% |
| Y | 7513 | 4.2% |
| r | 7513 | 4.2% |
| Other values (5) | 36670 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22671 | |
| a | 15087 | |
| W | 7579 | 4.2% |
| k | 7579 | 4.2% |
| D | 7574 | 4.2% |
| i | 7574 | 4.2% |
| Y | 7513 | 4.2% |
| r | 7513 | 4.2% |
| Other values (5) | 36670 |
last_purchase_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 984242 | 29988 |
| Distinct (%) | 98.4% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 968656 | 29976 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2021-09-11 04:22:38 | 2021-01-20 18:14:28 |
| 2nd row | 2021-05-16 12:01:16 | 2021-07-08 12:01:54 |
| 3rd row | 2021-02-07 16:47:48 | 2021-04-26 19:20:09 |
| 4th row | 2021-12-30 23:48:26 | 2021-02-19 10:14:53 |
| 5th row | 2021-11-02 11:48:25 | 2021-01-14 04:24:54 |
| Value | Count | Frequency (%) |
| 2021-01-02 | 2870 | 0.1% |
| 2021-05-14 | 2866 | 0.1% |
| 2021-12-25 | 2860 | 0.1% |
| 2021-01-17 | 2860 | 0.1% |
| 2021-10-17 | 2856 | 0.1% |
| 2021-01-26 | 2856 | 0.1% |
| 2021-08-16 | 2854 | 0.1% |
| 2021-05-05 | 2852 | 0.1% |
| 2021-10-16 | 2850 | 0.1% |
| 2021-09-17 | 2849 | 0.1% |
| Other values (86754) | 1971427 |
| Value | Count | Frequency (%) |
| 2021-01-28 | 110 | 0.2% |
| 2021-04-27 | 110 | 0.2% |
| 2021-08-04 | 110 | 0.2% |
| 2021-04-29 | 107 | 0.2% |
| 2021-09-17 | 104 | 0.2% |
| 2021-06-30 | 102 | 0.2% |
| 2021-04-20 | 102 | 0.2% |
| 2021-02-02 | 102 | 0.2% |
| 2021-08-17 | 101 | 0.2% |
| 2021-08-23 | 101 | 0.2% |
| Other values (25697) | 58951 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102631 | |
| 0 | 98938 | |
| 1 | 87968 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26805 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23890 | 4.2% |
| 8 | 14026 | 2.5% |
| Other values (3) | 41728 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102631 | |
| 0 | 98938 | |
| 1 | 87968 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26805 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23890 | 4.2% |
| 8 | 14026 | 2.5% |
| Other values (3) | 41728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102631 | |
| 0 | 98938 | |
| 1 | 87968 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26805 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23890 | 4.2% |
| 8 | 14026 | 2.5% |
| Other values (3) | 41728 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102631 | |
| 0 | 98938 | |
| 1 | 87968 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26805 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23890 | 4.2% |
| 8 | 14026 | 2.5% |
| Other values (3) | 41728 |
avg_discount_used
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.25001009 | 0.249945 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 10010 | 294 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.02 |
| Q1 | 0.13 | 0.13 |
| median | 0.25 | 0.25 |
| Q3 | 0.38 | 0.37 |
| 95-th percentile | 0.47 | 0.47 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.25 | 0.24 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 0.1443825628 | 0.1436977683 |
| Coefficient of variation (CV) | 0.5775069431 | 0.574917555 |
| Kurtosis | -1.19810725 | -1.189045062 |
| Mean | 0.25001009 | 0.249945 |
| Median Absolute Deviation (MAD) | 0.12 | 0.12 |
| Skewness | 0.0002818589406 | 0.002082182261 |
| Sum | 250010.09 | 7498.35 |
| Variance | 0.02084632444 | 0.02064904861 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.39 | 20194 | 2.0% |
| 0.15 | 20188 | 2.0% |
| 0.08 | 20140 | 2.0% |
| 0.21 | 20138 | 2.0% |
| 0.34 | 20131 | 2.0% |
| 0.47 | 20125 | 2.0% |
| 0.05 | 20124 | 2.0% |
| 0.16 | 20123 | 2.0% |
| 0.46 | 20109 | 2.0% |
| 0.32 | 20093 | 2.0% |
| Other values (41) | 798635 |
| Value | Count | Frequency (%) |
| 0.11 | 683 | 2.3% |
| 0.21 | 668 | 2.2% |
| 0.17 | 652 | 2.2% |
| 0.34 | 644 | 2.1% |
| 0.33 | 643 | 2.1% |
| 0.31 | 638 | 2.1% |
| 0.28 | 623 | 2.1% |
| 0.41 | 623 | 2.1% |
| 0.4 | 622 | 2.1% |
| 0.43 | 619 | 2.1% |
| Other values (41) | 23585 |
| Value | Count | Frequency (%) |
| 0 | 10010 | |
| 0.01 | 19893 | |
| 0.02 | 19951 | |
| 0.03 | 19949 | |
| 0.04 | 20004 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 0.01 | 603 | |
| 0.02 | 607 | |
| 0.03 | 564 | |
| 0.04 | 570 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 0.01 | 603 | |
| 0.02 | 607 | |
| 0.03 | 564 | |
| 0.04 | 570 |
| Value | Count | Frequency (%) |
| 0 | 10010 | |
| 0.01 | 19893 | |
| 0.02 | 19951 | |
| 0.03 | 19949 | |
| 0.04 | 20004 |
preferred_store
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 10 | 10 |
| Mean length | 10 | 10 |
| Min length | 10 | 10 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Location A | Location A |
| 2nd row | Location C | Location B |
| 3rd row | Location B | Location A |
| 4th row | Location B | Location B |
| 5th row | Location B | Location C |
| Value | Count | Frequency (%) |
| location | 1000000 | |
| b | 250262 | 12.5% |
| d | 250007 | 12.5% |
| a | 249949 | 12.5% |
| c | 249782 | 12.5% |
| Value | Count | Frequency (%) |
| location | 30000 | |
| a | 7555 | 12.6% |
| b | 7550 | 12.6% |
| d | 7484 | 12.5% |
| c | 7411 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7555 | 2.5% |
| B | 7550 | 2.5% |
| Other values (2) | 14895 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7555 | 2.5% |
| B | 7550 | 2.5% |
| Other values (2) | 14895 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7555 | 2.5% |
| B | 7550 | 2.5% |
| Other values (2) | 14895 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7555 | 2.5% |
| B | 7550 | 2.5% |
| Other values (2) | 14895 | 5.0% |
online_purchases
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.446018 | 49.47733333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 9997 | 299 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 4 | 4 |
| Q1 | 24 | 25 |
| median | 49 | 49 |
| Q3 | 74 | 74 |
| 95-th percentile | 94 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 49 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.86143913 | 28.79320732 |
| Coefficient of variation (CV) | 0.5836959234 | 0.581947437 |
| Kurtosis | -1.200754846 | -1.193949208 |
| Mean | 49.446018 | 49.47733333 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.00143421854 | -0.00151205085 |
| Sum | 49446018 | 1484320 |
| Variance | 832.9826689 | 829.0487878 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 10324 | 1.0% |
| 28 | 10269 | 1.0% |
| 40 | 10198 | 1.0% |
| 67 | 10151 | 1.0% |
| 76 | 10150 | 1.0% |
| 61 | 10150 | 1.0% |
| 52 | 10140 | 1.0% |
| 88 | 10134 | 1.0% |
| 43 | 10133 | 1.0% |
| 45 | 10132 | 1.0% |
| Other values (90) | 898219 |
| Value | Count | Frequency (%) |
| 33 | 337 | 1.1% |
| 49 | 333 | 1.1% |
| 51 | 331 | 1.1% |
| 60 | 329 | 1.1% |
| 6 | 329 | 1.1% |
| 42 | 326 | 1.1% |
| 28 | 326 | 1.1% |
| 96 | 324 | 1.1% |
| 38 | 324 | 1.1% |
| 76 | 324 | 1.1% |
| Other values (90) | 26717 |
| Value | Count | Frequency (%) |
| 0 | 9997 | |
| 1 | 10023 | |
| 2 | 9792 | |
| 3 | 10091 | |
| 4 | 10324 |
| Value | Count | Frequency (%) |
| 0 | 299 | |
| 1 | 301 | |
| 2 | 293 | |
| 3 | 322 | |
| 4 | 313 |
| Value | Count | Frequency (%) |
| 0 | 299 | |
| 1 | 301 | |
| 2 | 293 | |
| 3 | 322 | |
| 4 | 313 |
| Value | Count | Frequency (%) |
| 0 | 9997 | |
| 1 | 10023 | |
| 2 | 9792 | |
| 3 | 10091 | |
| 4 | 10324 |
in_store_purchases
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.484486 | 49.26486667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10016 | 321 |
| Zeros (%) | 1.0% | 1.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 5 | 5 |
| Q1 | 24 | 24 |
| median | 49 | 49 |
| Q3 | 75 | 74 |
| 95-th percentile | 95 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 51 | 50 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.88271174 | 28.80988452 |
| Coefficient of variation (CV) | 0.5836720572 | 0.5847957473 |
| Kurtosis | -1.20140369 | -1.198586796 |
| Mean | 49.484486 | 49.26486667 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.00159043567 | 0.01270653627 |
| Sum | 49484486 | 1477946 |
| Variance | 834.2110375 | 830.009446 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 38 | 10264 | 1.0% |
| 30 | 10186 | 1.0% |
| 86 | 10183 | 1.0% |
| 10 | 10180 | 1.0% |
| 14 | 10171 | 1.0% |
| 7 | 10166 | 1.0% |
| 13 | 10164 | 1.0% |
| 50 | 10151 | 1.0% |
| 67 | 10141 | 1.0% |
| 91 | 10131 | 1.0% |
| Other values (90) | 898263 |
| Value | Count | Frequency (%) |
| 71 | 336 | 1.1% |
| 26 | 334 | 1.1% |
| 31 | 334 | 1.1% |
| 63 | 332 | 1.1% |
| 32 | 329 | 1.1% |
| 89 | 327 | 1.1% |
| 67 | 325 | 1.1% |
| 16 | 323 | 1.1% |
| 79 | 323 | 1.1% |
| 28 | 322 | 1.1% |
| Other values (90) | 26715 |
| Value | Count | Frequency (%) |
| 0 | 10016 | |
| 1 | 9978 | |
| 2 | 9953 | |
| 3 | 9965 | |
| 4 | 9926 |
| Value | Count | Frequency (%) |
| 0 | 321 | |
| 1 | 284 | |
| 2 | 283 | |
| 3 | 316 | |
| 4 | 287 |
| Value | Count | Frequency (%) |
| 0 | 321 | |
| 1 | 284 | |
| 2 | 283 | |
| 3 | 316 | |
| 4 | 287 |
| Value | Count | Frequency (%) |
| 0 | 10016 | |
| 1 | 9978 | |
| 2 | 9953 | |
| 3 | 9965 | |
| 4 | 9926 |
avg_items_per_transaction
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 901 | 901 |
| Distinct (%) | 0.1% | 3.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.50312187 | 5.484473333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.45 | 1.43 |
| Q1 | 3.26 | 3.23 |
| median | 5.5 | 5.49 |
| Q3 | 7.75 | 7.73 |
| 95-th percentile | 9.55 | 9.56 |
| Maximum | 10 | 10 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 4.49 | 4.5 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.597661275 | 2.606511798 |
| Coefficient of variation (CV) | 0.4720341173 | 0.4752528893 |
| Kurtosis | -1.199082145 | -1.198518161 |
| Mean | 5.50312187 | 5.484473333 |
| Median Absolute Deviation (MAD) | 2.25 | 2.25 |
| Skewness | -4.461054903 × 10-5 | 0.003201275435 |
| Sum | 5503121.87 | 164534.2 |
| Variance | 6.747844097 | 6.793903753 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 3.49 | 1205 | 0.1% |
| 5 | 1198 | 0.1% |
| 3.94 | 1197 | 0.1% |
| 6.41 | 1196 | 0.1% |
| 2.82 | 1193 | 0.1% |
| 8.41 | 1192 | 0.1% |
| 9.69 | 1192 | 0.1% |
| 4.29 | 1190 | 0.1% |
| 4.35 | 1188 | 0.1% |
| 6.14 | 1188 | 0.1% |
| Other values (891) | 988061 |
| Value | Count | Frequency (%) |
| 5.89 | 59 | 0.2% |
| 5.11 | 50 | 0.2% |
| 9.59 | 48 | 0.2% |
| 6.8 | 48 | 0.2% |
| 2.22 | 48 | 0.2% |
| 5.34 | 47 | 0.2% |
| 1.49 | 46 | 0.2% |
| 5.21 | 46 | 0.2% |
| 8.13 | 46 | 0.2% |
| 5.58 | 46 | 0.2% |
| Other values (891) | 29516 |
| Value | Count | Frequency (%) |
| 1 | 514 | |
| 1.01 | 1135 | |
| 1.02 | 1105 | |
| 1.03 | 1122 | |
| 1.04 | 1067 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 1.01 | 26 | |
| 1.02 | 42 | |
| 1.03 | 36 | |
| 1.04 | 30 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 1.01 | 26 | |
| 1.02 | 42 | |
| 1.03 | 36 | |
| 1.04 | 30 |
| Value | Count | Frequency (%) |
| 1 | 514 | |
| 1.01 | 1135 | |
| 1.02 | 1105 | |
| 1.03 | 1122 | |
| 1.04 | 1067 |
avg_transaction_value
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 49001 | 22408 |
| Distinct (%) | 4.9% | 74.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 255.1157678 | 254.3950917 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10 |
| Maximum | 500 | 499.99 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10 |
| 5-th percentile | 34.52 | 34.1295 |
| Q1 | 132.51 | 132.5075 |
| median | 255.23 | 254.55 |
| Q3 | 377.67 | 375.81 |
| 95-th percentile | 475.36 | 475.502 |
| Maximum | 500 | 499.99 |
| Range | 490 | 489.99 |
| Interquartile range (IQR) | 245.16 | 243.3025 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 141.4300141 | 141.2426008 |
| Coefficient of variation (CV) | 0.5543758243 | 0.5552096146 |
| Kurtosis | -1.200885422 | -1.191863439 |
| Mean | 255.1157678 | 254.3950917 |
| Median Absolute Deviation (MAD) | 122.58 | 121.64 |
| Skewness | -0.001148163222 | 0.004421617675 |
| Sum | 255115767.8 | 7631852.75 |
| Variance | 20002.44888 | 19949.47228 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 362.11 | 43 | < 0.1% |
| 157.68 | 43 | < 0.1% |
| 86.72 | 42 | < 0.1% |
| 303.99 | 41 | < 0.1% |
| 193.66 | 41 | < 0.1% |
| 112.64 | 40 | < 0.1% |
| 342.26 | 40 | < 0.1% |
| 454.72 | 39 | < 0.1% |
| 64.18 | 39 | < 0.1% |
| 280.55 | 39 | < 0.1% |
| Other values (48991) | 999593 |
| Value | Count | Frequency (%) |
| 262.56 | 6 | < 0.1% |
| 275.93 | 5 | < 0.1% |
| 359.56 | 5 | < 0.1% |
| 149.46 | 5 | < 0.1% |
| 153.96 | 5 | < 0.1% |
| 245.13 | 5 | < 0.1% |
| 194.86 | 5 | < 0.1% |
| 416.22 | 5 | < 0.1% |
| 338.17 | 5 | < 0.1% |
| 413.57 | 5 | < 0.1% |
| Other values (22398) | 29949 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 18 | |
| 10.02 | 17 | |
| 10.03 | 28 | |
| 10.04 | 24 |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 10.03 | 1 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.06 | 2 |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 10.03 | 1 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.06 | 2 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 18 | |
| 10.02 | 17 | |
| 10.03 | 28 | |
| 10.04 | 24 |
total_returned_items
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 10 | 10 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4.498142 | 4.474033333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 9 | 9 |
| Zeros | 100060 | 3026 |
| Zeros (%) | 10.0% | 10.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 2 | 2 |
| median | 4 | 4 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 5 | 5 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.872805041 | 2.859463519 |
| Coefficient of variation (CV) | 0.6386648177 | 0.6391243216 |
| Kurtosis | -1.225109848 | -1.207499289 |
| Mean | 4.498142 | 4.474033333 |
| Median Absolute Deviation (MAD) | 3 | 2 |
| Skewness | 0.0007692254728 | 0.01226172092 |
| Sum | 4498142 | 134221 |
| Variance | 8.253008801 | 8.176531617 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 100298 | |
| 7 | 100190 | |
| 3 | 100119 | |
| 0 | 100060 | |
| 6 | 100004 | |
| 2 | 99991 | |
| 9 | 99942 | |
| 8 | 99838 | |
| 4 | 99821 | |
| 5 | 99737 |
| Value | Count | Frequency (%) |
| 3 | 3097 | |
| 5 | 3073 | |
| 4 | 3035 | |
| 0 | 3026 | |
| 6 | 3024 | |
| 2 | 2989 | |
| 1 | 2967 | |
| 9 | 2948 | |
| 7 | 2924 | |
| 8 | 2917 |
| Value | Count | Frequency (%) |
| 0 | 100060 | |
| 1 | 100298 | |
| 2 | 99991 | |
| 3 | 100119 | |
| 4 | 99821 |
| Value | Count | Frequency (%) |
| 0 | 3026 | |
| 1 | 2967 | |
| 2 | 2989 | |
| 3 | 3097 | |
| 4 | 3035 |
| Value | Count | Frequency (%) |
| 0 | 3026 | |
| 1 | 2967 | |
| 2 | 2989 | |
| 3 | 3097 | |
| 4 | 3035 |
| Value | Count | Frequency (%) |
| 0 | 100060 | |
| 1 | 100298 | |
| 2 | 99991 | |
| 3 | 100119 | |
| 4 | 99821 |
total_returned_value
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 99999 | 25888 |
| Distinct (%) | 10.0% | 86.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500.3878374 | 501.0130153 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.03 |
| Maximum | 1000 | 1000 |
| Zeros | 4 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.03 |
| 5-th percentile | 50 | 50.778 |
| Q1 | 250.63 | 253.4025 |
| median | 500.4 | 498.185 |
| Q3 | 750.39 | 751.7325 |
| 95-th percentile | 950.22 | 949.7815 |
| Maximum | 1000 | 1000 |
| Range | 1000 | 999.97 |
| Interquartile range (IQR) | 499.76 | 498.33 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288.7174763 | 288.2634295 |
| Coefficient of variation (CV) | 0.5769873981 | 0.5753611597 |
| Kurtosis | -1.199754459 | -1.200630145 |
| Mean | 500.3878374 | 501.0130153 |
| Median Absolute Deviation (MAD) | 249.89 | 249.22 |
| Skewness | -0.001264828821 | 0.003471319382 |
| Sum | 500387837.4 | 15030390.46 |
| Variance | 83357.78115 | 83095.8048 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 160.66 | 28 | < 0.1% |
| 467.66 | 26 | < 0.1% |
| 188.3 | 26 | < 0.1% |
| 488.88 | 25 | < 0.1% |
| 544.94 | 25 | < 0.1% |
| 651.87 | 25 | < 0.1% |
| 981.42 | 25 | < 0.1% |
| 330.91 | 25 | < 0.1% |
| 676.05 | 25 | < 0.1% |
| 227.59 | 25 | < 0.1% |
| Other values (99989) | 999745 |
| Value | Count | Frequency (%) |
| 260.36 | 6 | < 0.1% |
| 15.38 | 5 | < 0.1% |
| 108.29 | 5 | < 0.1% |
| 392.6 | 4 | < 0.1% |
| 282.57 | 4 | < 0.1% |
| 851.39 | 4 | < 0.1% |
| 815.1 | 4 | < 0.1% |
| 122.83 | 4 | < 0.1% |
| 251.53 | 4 | < 0.1% |
| 730.69 | 4 | < 0.1% |
| Other values (25878) | 29956 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 0.01 | 13 | |
| 0.02 | 12 | |
| 0.03 | 11 | |
| 0.04 | 7 |
| Value | Count | Frequency (%) |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.06 | 1 | |
| 0.11 | 1 | |
| 0.17 | 1 |
| Value | Count | Frequency (%) |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.06 | 1 | |
| 0.11 | 1 | |
| 0.17 | 1 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 0.01 | 13 | |
| 0.02 | 12 | |
| 0.03 | 11 | |
| 0.04 | 7 |
total_sales
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 629254 | 29561 |
| Distinct (%) | 62.9% | 98.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5056.059765 | 5039.871729 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 100.01 | 100.06 |
| Maximum | 9999.98 | 9999.78 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 100.01 | 100.06 |
| 5-th percentile | 595.7095 | 585.243 |
| Q1 | 2577.8675 | 2544.635 |
| median | 5059.695 | 5029.3 |
| Q3 | 7534.8025 | 7536.7425 |
| 95-th percentile | 9507.96 | 9496.1245 |
| Maximum | 9999.98 | 9999.78 |
| Range | 9899.97 | 9899.72 |
| Interquartile range (IQR) | 4956.935 | 4992.1075 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2859.100058 | 2862.248413 |
| Coefficient of variation (CV) | 0.5654798777 | 0.5679208851 |
| Kurtosis | -1.201132214 | -1.207616505 |
| Mean | 5056.059765 | 5039.871729 |
| Median Absolute Deviation (MAD) | 2478.365 | 2495.845 |
| Skewness | -0.002792355347 | 0.001697316071 |
| Sum | 5056059765 | 151196151.9 |
| Variance | 8174453.14 | 8192465.978 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 9263.29 | 8 | < 0.1% |
| 1070.51 | 8 | < 0.1% |
| 8973.11 | 8 | < 0.1% |
| 7882.97 | 8 | < 0.1% |
| 630.03 | 8 | < 0.1% |
| 8669.59 | 8 | < 0.1% |
| 8191.02 | 8 | < 0.1% |
| 2558.91 | 8 | < 0.1% |
| 5572.95 | 8 | < 0.1% |
| 8266.95 | 8 | < 0.1% |
| Other values (629244) | 999920 |
| Value | Count | Frequency (%) |
| 9287.32 | 3 | < 0.1% |
| 5343.21 | 3 | < 0.1% |
| 8517.35 | 2 | < 0.1% |
| 6020.86 | 2 | < 0.1% |
| 5550.99 | 2 | < 0.1% |
| 3278.77 | 2 | < 0.1% |
| 3372.59 | 2 | < 0.1% |
| 7783.93 | 2 | < 0.1% |
| 569.43 | 2 | < 0.1% |
| 7306.93 | 2 | < 0.1% |
| Other values (29551) | 29978 |
| Value | Count | Frequency (%) |
| 100.01 | 2 | |
| 100.02 | 2 | |
| 100.04 | 2 | |
| 100.05 | 2 | |
| 100.06 | 3 |
| Value | Count | Frequency (%) |
| 100.06 | 1 | |
| 100.14 | 1 | |
| 100.38 | 1 | |
| 100.48 | 1 | |
| 101.81 | 1 |
| Value | Count | Frequency (%) |
| 100.06 | 1 | |
| 100.14 | 1 | |
| 100.38 | 1 | |
| 100.48 | 1 | |
| 101.81 | 1 |
| Value | Count | Frequency (%) |
| 100.01 | 2 | |
| 100.02 | 2 | |
| 100.04 | 2 | |
| 100.05 | 2 | |
| 100.06 | 3 |
total_transactions
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 99 | 99 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.987386 | 49.79873333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 99 | 99 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 5 | 5 |
| Q1 | 25 | 25 |
| median | 50 | 49 |
| Q3 | 75 | 75 |
| 95-th percentile | 95 | 95 |
| Maximum | 99 | 99 |
| Range | 98 | 98 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.57168895 | 28.65419796 |
| Coefficient of variation (CV) | 0.5715779766 | 0.5754001365 |
| Kurtosis | -1.200697232 | -1.206897219 |
| Mean | 49.987386 | 49.79873333 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 6.496950968 × 10-5 | 0.01136407827 |
| Sum | 49987386 | 1493962 |
| Variance | 816.3414092 | 821.0630605 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 93 | 10385 | 1.0% |
| 24 | 10328 | 1.0% |
| 61 | 10316 | 1.0% |
| 70 | 10306 | 1.0% |
| 49 | 10290 | 1.0% |
| 14 | 10280 | 1.0% |
| 83 | 10278 | 1.0% |
| 27 | 10251 | 1.0% |
| 16 | 10247 | 1.0% |
| 75 | 10245 | 1.0% |
| Other values (89) | 897074 |
| Value | Count | Frequency (%) |
| 34 | 342 | 1.1% |
| 41 | 338 | 1.1% |
| 62 | 329 | 1.1% |
| 16 | 329 | 1.1% |
| 36 | 329 | 1.1% |
| 13 | 328 | 1.1% |
| 11 | 327 | 1.1% |
| 70 | 327 | 1.1% |
| 83 | 325 | 1.1% |
| 47 | 323 | 1.1% |
| Other values (89) | 26703 |
| Value | Count | Frequency (%) |
| 1 | 10053 | |
| 2 | 10174 | |
| 3 | 10113 | |
| 4 | 10133 | |
| 5 | 10140 |
| Value | Count | Frequency (%) |
| 1 | 311 | |
| 2 | 302 | |
| 3 | 310 | |
| 4 | 289 | |
| 5 | 306 |
| Value | Count | Frequency (%) |
| 1 | 311 | |
| 2 | 302 | |
| 3 | 310 | |
| 4 | 289 | |
| 5 | 306 |
| Value | Count | Frequency (%) |
| 1 | 10053 | |
| 2 | 10174 | |
| 3 | 10113 | |
| 4 | 10133 | |
| 5 | 10140 |
total_items_purchased
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 499 | 499 |
| Distinct (%) | < 0.1% | 1.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 250.042763 | 251.2563333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 499 | 499 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 26 | 26 |
| Q1 | 125 | 126 |
| median | 250 | 252 |
| Q3 | 375 | 377 |
| 95-th percentile | 475 | 475 |
| Maximum | 499 | 499 |
| Range | 498 | 498 |
| Interquartile range (IQR) | 250 | 251 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 143.9845462 | 144.3084035 |
| Coefficient of variation (CV) | 0.5758396862 | 0.5743473273 |
| Kurtosis | -1.199364571 | -1.201958267 |
| Mean | 250.042763 | 251.2563333 |
| Median Absolute Deviation (MAD) | 125 | 125 |
| Skewness | -0.0005289537985 | -0.008483461714 |
| Sum | 250042763 | 7537690 |
| Variance | 20731.54954 | 20824.91532 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 282 | 2156 | 0.2% |
| 285 | 2146 | 0.2% |
| 355 | 2132 | 0.2% |
| 459 | 2099 | 0.2% |
| 296 | 2098 | 0.2% |
| 241 | 2096 | 0.2% |
| 413 | 2090 | 0.2% |
| 331 | 2088 | 0.2% |
| 425 | 2087 | 0.2% |
| 260 | 2086 | 0.2% |
| Other values (489) | 978922 |
| Value | Count | Frequency (%) |
| 199 | 84 | 0.3% |
| 340 | 82 | 0.3% |
| 263 | 80 | 0.3% |
| 352 | 77 | 0.3% |
| 457 | 76 | 0.3% |
| 471 | 76 | 0.3% |
| 225 | 76 | 0.3% |
| 121 | 76 | 0.3% |
| 29 | 76 | 0.3% |
| 283 | 75 | 0.2% |
| Other values (489) | 29222 |
| Value | Count | Frequency (%) |
| 1 | 2005 | |
| 2 | 2077 | |
| 3 | 1999 | |
| 4 | 2019 | |
| 5 | 1988 |
| Value | Count | Frequency (%) |
| 1 | 63 | |
| 2 | 69 | |
| 3 | 51 | |
| 4 | 65 | |
| 5 | 46 |
| Value | Count | Frequency (%) |
| 1 | 63 | |
| 2 | 69 | |
| 3 | 51 | |
| 4 | 65 | |
| 5 | 46 |
| Value | Count | Frequency (%) |
| 1 | 2005 | |
| 2 | 2077 | |
| 3 | 1999 | |
| 4 | 2019 | |
| 5 | 1988 |
total_discounts_received
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 99995 | 25951 |
| Distinct (%) | 10.0% | 86.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.6743882 | 499.1838027 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.02 |
| Maximum | 1000 | 999.88 |
| Zeros | 6 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.02 |
| 5-th percentile | 50.16 | 49.6795 |
| Q1 | 249.76 | 248.7875 |
| median | 499.51 | 498.065 |
| Q3 | 749.54 | 751.8575 |
| 95-th percentile | 949.66 | 951.02 |
| Maximum | 1000 | 999.88 |
| Range | 1000 | 999.86 |
| Interquartile range (IQR) | 499.78 | 503.07 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288.5791016 | 289.4257723 |
| Coefficient of variation (CV) | 0.5775343071 | 0.5797980039 |
| Kurtosis | -1.200167414 | -1.207019765 |
| Mean | 499.6743882 | 499.1838027 |
| Median Absolute Deviation (MAD) | 249.9 | 251.37 |
| Skewness | 0.0009745010535 | 0.001267690242 |
| Sum | 499674388.2 | 14975514.08 |
| Variance | 83277.89786 | 83767.2777 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 52.87 | 26 | < 0.1% |
| 811.21 | 26 | < 0.1% |
| 721.58 | 26 | < 0.1% |
| 418.88 | 25 | < 0.1% |
| 760.97 | 25 | < 0.1% |
| 406.5 | 24 | < 0.1% |
| 784.58 | 24 | < 0.1% |
| 595.87 | 24 | < 0.1% |
| 918 | 24 | < 0.1% |
| 34.86 | 24 | < 0.1% |
| Other values (99985) | 999752 |
| Value | Count | Frequency (%) |
| 236.75 | 5 | < 0.1% |
| 236.37 | 5 | < 0.1% |
| 157.68 | 4 | < 0.1% |
| 240.47 | 4 | < 0.1% |
| 643.15 | 4 | < 0.1% |
| 28.71 | 4 | < 0.1% |
| 557.58 | 4 | < 0.1% |
| 44.76 | 4 | < 0.1% |
| 788.39 | 4 | < 0.1% |
| 234.73 | 4 | < 0.1% |
| Other values (25941) | 29958 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.01 | 13 | |
| 0.02 | 8 | |
| 0.03 | 8 | |
| 0.04 | 6 |
| Value | Count | Frequency (%) |
| 0.02 | 1 | |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.06 | 1 | |
| 0.13 | 1 |
| Value | Count | Frequency (%) |
| 0.02 | 1 | |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.06 | 1 | |
| 0.13 | 1 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.01 | 13 | |
| 0.02 | 8 | |
| 0.03 | 8 | |
| 0.04 | 6 |
avg_spent_per_category
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 98999 | 25850 |
| Distinct (%) | 9.9% | 86.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 505.1754779 | 504.7207147 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| Maximum | 1000 | 1000 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| 5-th percentile | 59.49 | 59.958 |
| Q1 | 257.24 | 257.0825 |
| median | 505.14 | 501.885 |
| Q3 | 753.06 | 753.1025 |
| 95-th percentile | 950.7405 | 951.441 |
| Maximum | 1000 | 1000 |
| Range | 990 | 989.98 |
| Interquartile range (IQR) | 495.82 | 496.02 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 286.0591784 | 286.2765849 |
| Coefficient of variation (CV) | 0.566257055 | 0.5671980099 |
| Kurtosis | -1.201963641 | -1.201876857 |
| Mean | 505.1754779 | 504.7207147 |
| Median Absolute Deviation (MAD) | 247.91 | 247.985 |
| Skewness | -0.0002454959133 | 0.008955884632 |
| Sum | 505175477.9 | 15141621.44 |
| Variance | 81829.85355 | 81954.28307 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 202.69 | 27 | < 0.1% |
| 969.16 | 26 | < 0.1% |
| 582.24 | 26 | < 0.1% |
| 806.1 | 25 | < 0.1% |
| 330.74 | 25 | < 0.1% |
| 798.54 | 25 | < 0.1% |
| 299.29 | 25 | < 0.1% |
| 312.38 | 25 | < 0.1% |
| 825.53 | 24 | < 0.1% |
| 525.28 | 24 | < 0.1% |
| Other values (98989) | 999748 |
| Value | Count | Frequency (%) |
| 174.08 | 5 | < 0.1% |
| 112.13 | 5 | < 0.1% |
| 966.69 | 4 | < 0.1% |
| 866.33 | 4 | < 0.1% |
| 65.43 | 4 | < 0.1% |
| 628.89 | 4 | < 0.1% |
| 35.16 | 4 | < 0.1% |
| 803.28 | 4 | < 0.1% |
| 99.97 | 4 | < 0.1% |
| 344.54 | 4 | < 0.1% |
| Other values (25840) | 29958 |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 13 | |
| 10.03 | 10 | |
| 10.04 | 13 |
| Value | Count | Frequency (%) |
| 10.02 | 2 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.15 | 1 | |
| 10.32 | 1 |
| Value | Count | Frequency (%) |
| 10.02 | 2 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.15 | 1 | |
| 10.32 | 1 |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 13 | |
| 10.03 | 10 | |
| 10.04 | 13 |
max_single_purchase_value
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 99001 | 25798 |
| Distinct (%) | 9.9% | 86.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 505.0014045 | 501.5595277 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.04 |
| Maximum | 1000 | 999.99 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10 | 10.04 |
| 5-th percentile | 59.3 | 58.76 |
| Q1 | 256.84 | 251.2725 |
| median | 505.22 | 499.845 |
| Q3 | 753.21 | 750.16 |
| 95-th percentile | 950.55 | 951.283 |
| Maximum | 1000 | 999.99 |
| Range | 990 | 989.95 |
| Interquartile range (IQR) | 496.37 | 498.8875 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 286.0733241 | 286.5305105 |
| Coefficient of variation (CV) | 0.5664802545 | 0.5712791698 |
| Kurtosis | -1.202495075 | -1.210325642 |
| Mean | 505.0014045 | 501.5595277 |
| Median Absolute Deviation (MAD) | 248.18 | 249.455 |
| Skewness | -0.0008466890807 | 0.01889523734 |
| Sum | 505001404.5 | 15046785.83 |
| Variance | 81837.94677 | 82099.73347 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 575.57 | 28 | < 0.1% |
| 461.6 | 26 | < 0.1% |
| 874.29 | 25 | < 0.1% |
| 105.78 | 25 | < 0.1% |
| 354.85 | 25 | < 0.1% |
| 736.87 | 25 | < 0.1% |
| 439.72 | 25 | < 0.1% |
| 893.75 | 24 | < 0.1% |
| 179.32 | 24 | < 0.1% |
| 330.94 | 24 | < 0.1% |
| Other values (98991) | 999749 |
| Value | Count | Frequency (%) |
| 261.06 | 5 | < 0.1% |
| 39.12 | 5 | < 0.1% |
| 769.44 | 4 | < 0.1% |
| 99.25 | 4 | < 0.1% |
| 879.12 | 4 | < 0.1% |
| 883.42 | 4 | < 0.1% |
| 507.84 | 4 | < 0.1% |
| 548.53 | 4 | < 0.1% |
| 570.99 | 4 | < 0.1% |
| 668 | 4 | < 0.1% |
| Other values (25788) | 29958 |
| Value | Count | Frequency (%) |
| 10 | 6 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 15 | |
| 10.03 | 5 | < 0.1% |
| 10.04 | 15 |
| Value | Count | Frequency (%) |
| 10.04 | 1 | |
| 10.08 | 1 | |
| 10.09 | 2 | |
| 10.1 | 1 | |
| 10.11 | 1 |
| Value | Count | Frequency (%) |
| 10.04 | 1 | |
| 10.08 | 1 | |
| 10.09 | 2 | |
| 10.1 | 1 | |
| 10.11 | 1 |
| Value | Count | Frequency (%) |
| 10 | 6 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 15 | |
| 10.03 | 5 | < 0.1% |
| 10.04 | 15 |
min_single_purchase_value
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 991 | 991 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.04384896 | 5.048166667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| 5-th percentile | 0.59 | 0.61 |
| Q1 | 2.57 | 2.57 |
| median | 5.04 | 5.06 |
| Q3 | 7.51 | 7.51 |
| 95-th percentile | 9.5 | 9.47 |
| Maximum | 10 | 10 |
| Range | 9.9 | 9.9 |
| Interquartile range (IQR) | 4.94 | 4.94 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.855904644 | 2.845148112 |
| Coefficient of variation (CV) | 0.566215338 | 0.5636002732 |
| Kurtosis | -1.198193882 | -1.201234009 |
| Mean | 5.04384896 | 5.048166667 |
| Median Absolute Deviation (MAD) | 2.47 | 2.47 |
| Skewness | 0.002415403507 | -0.00852735842 |
| Sum | 5043848.96 | 151445 |
| Variance | 8.156191335 | 8.094867781 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4.66 | 1123 | 0.1% |
| 0.29 | 1112 | 0.1% |
| 3.05 | 1110 | 0.1% |
| 1.67 | 1101 | 0.1% |
| 4.67 | 1092 | 0.1% |
| 6.93 | 1092 | 0.1% |
| 6.14 | 1091 | 0.1% |
| 5.19 | 1091 | 0.1% |
| 5.31 | 1086 | 0.1% |
| 5.02 | 1086 | 0.1% |
| Other values (981) | 989016 |
| Value | Count | Frequency (%) |
| 0.19 | 48 | 0.2% |
| 1.31 | 44 | 0.1% |
| 2.78 | 44 | 0.1% |
| 8.09 | 44 | 0.1% |
| 5.02 | 43 | 0.1% |
| 0.97 | 43 | 0.1% |
| 2.2 | 43 | 0.1% |
| 1.06 | 43 | 0.1% |
| 6.16 | 43 | 0.1% |
| 5.81 | 43 | 0.1% |
| Other values (981) | 29562 |
| Value | Count | Frequency (%) |
| 0.1 | 491 | |
| 0.11 | 1041 | |
| 0.12 | 1011 | |
| 0.13 | 1044 | |
| 0.14 | 1013 |
| Value | Count | Frequency (%) |
| 0.1 | 15 | 0.1% |
| 0.11 | 31 | |
| 0.12 | 25 | |
| 0.13 | 38 | |
| 0.14 | 18 |
| Value | Count | Frequency (%) |
| 0.1 | 15 | < 0.1% |
| 0.11 | 31 | |
| 0.12 | 25 | |
| 0.13 | 38 | |
| 0.14 | 18 |
| Value | Count | Frequency (%) |
| 0.1 | 491 | |
| 0.11 | 1041 | |
| 0.12 | 1011 | |
| 0.13 | 1044 | |
| 0.14 | 1013 |
product_name
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 9 | 9 |
| Median length | 9 | 9 |
| Mean length | 9 | 9 |
| Min length | 9 | 9 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Product D | Product D |
| 2nd row | Product C | Product B |
| 3rd row | Product B | Product A |
| 4th row | Product A | Product B |
| 5th row | Product C | Product C |
| Value | Count | Frequency (%) |
| product | 1000000 | |
| b | 250375 | 12.5% |
| c | 249957 | 12.5% |
| a | 249928 | 12.5% |
| d | 249740 | 12.5% |
| Value | Count | Frequency (%) |
| product | 30000 | |
| b | 7575 | 12.6% |
| d | 7556 | 12.6% |
| c | 7441 | 12.4% |
| a | 7428 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| B | 7575 | 2.8% |
| D | 7556 | 2.8% |
| Other values (2) | 14869 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| B | 7575 | 2.8% |
| D | 7556 | 2.8% |
| Other values (2) | 14869 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| B | 7575 | 2.8% |
| D | 7556 | 2.8% |
| Other values (2) | 14869 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| B | 7575 | 2.8% |
| D | 7556 | 2.8% |
| Other values (2) | 14869 |
product_brand
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Brand Y | Brand Z |
| 2nd row | Brand X | Brand Z |
| 3rd row | Brand X | Brand X |
| 4th row | Brand Z | Brand Y |
| 5th row | Brand X | Brand X |
| Value | Count | Frequency (%) |
| brand | 1000000 | |
| y | 333775 | 16.7% |
| z | 333608 | 16.7% |
| x | 332617 | 16.6% |
| Value | Count | Frequency (%) |
| brand | 30000 | |
| z | 10045 | 16.7% |
| y | 10035 | 16.7% |
| x | 9920 | 16.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Z | 10045 | 4.8% |
| Y | 10035 | 4.8% |
| X | 9920 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Z | 10045 | 4.8% |
| Y | 10035 | 4.8% |
| X | 9920 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Z | 10045 | 4.8% |
| Y | 10035 | 4.8% |
| X | 9920 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Z | 10045 | 4.8% |
| Y | 10035 | 4.8% |
| X | 9920 | 4.7% |
product_rating
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 41 | 41 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 2.9990096 | 3.000136667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 5 | 5 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.2 | 1.2 |
| Q1 | 2 | 2 |
| median | 3 | 3 |
| Q3 | 4 | 4 |
| 95-th percentile | 4.8 | 4.8 |
| Maximum | 5 | 5 |
| Range | 4 | 4 |
| Interquartile range (IQR) | 2 | 2 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 1.154800603 | 1.159605021 |
| Coefficient of variation (CV) | 0.3850606557 | 0.3865173989 |
| Kurtosis | -1.196293362 | -1.202777711 |
| Mean | 2.9990096 | 3.000136667 |
| Median Absolute Deviation (MAD) | 1 | 1 |
| Skewness | -0.0005343871929 | -0.005588273582 |
| Sum | 2999009.6 | 90004.1 |
| Variance | 1.333564433 | 1.344683804 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2.9 | 25242 | 2.5% |
| 3.4 | 25229 | 2.5% |
| 2.6 | 25229 | 2.5% |
| 1.3 | 25194 | 2.5% |
| 3 | 25181 | 2.5% |
| 4.7 | 25166 | 2.5% |
| 4.3 | 25159 | 2.5% |
| 4.1 | 25146 | 2.5% |
| 1.6 | 25141 | 2.5% |
| 4 | 25134 | 2.5% |
| Other values (31) | 748179 |
| Value | Count | Frequency (%) |
| 4.6 | 811 | 2.7% |
| 2 | 796 | 2.7% |
| 1.4 | 783 | 2.6% |
| 4.4 | 780 | 2.6% |
| 3.4 | 780 | 2.6% |
| 1.6 | 770 | 2.6% |
| 3.3 | 768 | 2.6% |
| 3.8 | 768 | 2.6% |
| 4.2 | 767 | 2.6% |
| 2.6 | 765 | 2.5% |
| Other values (31) | 22212 |
| Value | Count | Frequency (%) |
| 1 | 12653 | |
| 1.1 | 24871 | |
| 1.2 | 25095 | |
| 1.3 | 25194 | |
| 1.4 | 24848 |
| Value | Count | Frequency (%) |
| 1 | 404 | |
| 1.1 | 760 | |
| 1.2 | 749 | |
| 1.3 | 755 | |
| 1.4 | 783 |
| Value | Count | Frequency (%) |
| 1 | 404 | |
| 1.1 | 760 | |
| 1.2 | 749 | |
| 1.3 | 755 | |
| 1.4 | 783 |
| Value | Count | Frequency (%) |
| 1 | 12653 | |
| 1.1 | 24871 | |
| 1.2 | 25095 | |
| 1.3 | 25194 | |
| 1.4 | 24848 |
product_review_count
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 1000 | 1000 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.235198 | 500.4784667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 999 | 999 |
| Zeros | 987 | 40 |
| Zeros (%) | 0.1% | 0.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 50 | 48.95 |
| Q1 | 250 | 250 |
| median | 499 | 502 |
| Q3 | 749 | 753 |
| 95-th percentile | 949 | 951 |
| Maximum | 999 | 999 |
| Range | 999 | 999 |
| Interquartile range (IQR) | 499 | 503 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288.4461496 | 289.9158218 |
| Coefficient of variation (CV) | 0.5777760678 | 0.5792773138 |
| Kurtosis | -1.19905271 | -1.209343923 |
| Mean | 499.235198 | 500.4784667 |
| Median Absolute Deviation (MAD) | 250 | 251 |
| Skewness | 0.001190014496 | -0.005998671865 |
| Sum | 499235198 | 15014354 |
| Variance | 83201.18122 | 84051.18371 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 974 | 1095 | 0.1% |
| 56 | 1089 | 0.1% |
| 769 | 1089 | 0.1% |
| 725 | 1088 | 0.1% |
| 683 | 1085 | 0.1% |
| 229 | 1082 | 0.1% |
| 501 | 1079 | 0.1% |
| 937 | 1074 | 0.1% |
| 384 | 1073 | 0.1% |
| 497 | 1072 | 0.1% |
| Other values (990) | 989174 |
| Value | Count | Frequency (%) |
| 907 | 51 | 0.2% |
| 704 | 47 | 0.2% |
| 90 | 47 | 0.2% |
| 627 | 46 | 0.2% |
| 884 | 44 | 0.1% |
| 785 | 44 | 0.1% |
| 818 | 44 | 0.1% |
| 328 | 43 | 0.1% |
| 769 | 43 | 0.1% |
| 20 | 43 | 0.1% |
| Other values (990) | 29548 |
| Value | Count | Frequency (%) |
| 0 | 987 | |
| 1 | 999 | |
| 2 | 1006 | |
| 3 | 1006 | |
| 4 | 1027 |
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 1 | 33 | |
| 2 | 31 | |
| 3 | 23 | |
| 4 | 38 |
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 1 | 33 | |
| 2 | 31 | |
| 3 | 23 | |
| 4 | 38 |
| Value | Count | Frequency (%) |
| 0 | 987 | |
| 1 | 999 | |
| 2 | 1006 | |
| 3 | 1006 | |
| 4 | 1027 |
product_stock
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.515129 | 49.56016667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10174 | 317 |
| Zeros (%) | 1.0% | 1.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 5 | 4 |
| Q1 | 25 | 24 |
| median | 49 | 50 |
| Q3 | 75 | 75 |
| 95-th percentile | 95 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 51 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.87664529 | 28.97239192 |
| Coefficient of variation (CV) | 0.5831883279 | 0.5845902842 |
| Kurtosis | -1.200520476 | -1.214128381 |
| Mean | 49.515129 | 49.56016667 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.0006383736941 | -0.00846355267 |
| Sum | 49515129 | 1486805 |
| Variance | 833.860643 | 839.3994933 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 89 | 10261 | 1.0% |
| 70 | 10245 | 1.0% |
| 60 | 10187 | 1.0% |
| 23 | 10175 | 1.0% |
| 0 | 10174 | 1.0% |
| 54 | 10171 | 1.0% |
| 44 | 10148 | 1.0% |
| 96 | 10147 | 1.0% |
| 32 | 10138 | 1.0% |
| 77 | 10136 | 1.0% |
| Other values (90) | 898218 |
| Value | Count | Frequency (%) |
| 35 | 340 | 1.1% |
| 18 | 330 | 1.1% |
| 5 | 330 | 1.1% |
| 82 | 328 | 1.1% |
| 70 | 327 | 1.1% |
| 50 | 326 | 1.1% |
| 33 | 325 | 1.1% |
| 66 | 324 | 1.1% |
| 71 | 324 | 1.1% |
| 3 | 323 | 1.1% |
| Other values (90) | 26723 |
| Value | Count | Frequency (%) |
| 0 | 10174 | |
| 1 | 9857 | |
| 2 | 9895 | |
| 3 | 10030 | |
| 4 | 9924 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 1 | 272 | |
| 2 | 299 | |
| 3 | 323 | |
| 4 | 313 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 1 | 272 | |
| 2 | 299 | |
| 3 | 323 | |
| 4 | 313 |
| Value | Count | Frequency (%) |
| 0 | 10174 | |
| 1 | 9857 | |
| 2 | 9895 | |
| 3 | 10030 | |
| 4 | 9924 |
product_return_rate
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.25013741 | 0.2500103333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 9960 | 299 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.02 |
| Q1 | 0.13 | 0.12 |
| median | 0.25 | 0.25 |
| Q3 | 0.38 | 0.38 |
| 95-th percentile | 0.48 | 0.48 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.25 | 0.26 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 0.1444084896 | 0.144895237 |
| Coefficient of variation (CV) | 0.5773166421 | 0.5795569931 |
| Kurtosis | -1.197824771 | -1.200929524 |
| Mean | 0.25013741 | 0.2500103333 |
| Median Absolute Deviation (MAD) | 0.13 | 0.13 |
| Skewness | -0.0005165569762 | -0.006130410715 |
| Sum | 250137.41 | 7500.31 |
| Variance | 0.02085381187 | 0.02099462971 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.43 | 20287 | 2.0% |
| 0.38 | 20282 | 2.0% |
| 0.03 | 20242 | 2.0% |
| 0.46 | 20215 | 2.0% |
| 0.4 | 20209 | 2.0% |
| 0.14 | 20164 | 2.0% |
| 0.45 | 20148 | 2.0% |
| 0.16 | 20140 | 2.0% |
| 0.06 | 20135 | 2.0% |
| 0.29 | 20118 | 2.0% |
| Other values (41) | 798060 |
| Value | Count | Frequency (%) |
| 0.23 | 661 | 2.2% |
| 0.28 | 654 | 2.2% |
| 0.01 | 649 | 2.2% |
| 0.08 | 639 | 2.1% |
| 0.44 | 635 | 2.1% |
| 0.33 | 633 | 2.1% |
| 0.21 | 628 | 2.1% |
| 0.42 | 626 | 2.1% |
| 0.41 | 625 | 2.1% |
| 0.03 | 620 | 2.1% |
| Other values (41) | 23630 |
| Value | Count | Frequency (%) |
| 0 | 9960 | |
| 0.01 | 19921 | |
| 0.02 | 19994 | |
| 0.03 | 20242 | |
| 0.04 | 19825 |
| Value | Count | Frequency (%) |
| 0 | 299 | |
| 0.01 | 649 | |
| 0.02 | 618 | |
| 0.03 | 620 | |
| 0.04 | 594 |
| Value | Count | Frequency (%) |
| 0 | 299 | |
| 0.01 | 649 | |
| 0.02 | 618 | |
| 0.03 | 620 | |
| 0.04 | 594 |
| Value | Count | Frequency (%) |
| 0 | 9960 | |
| 0.01 | 19921 | |
| 0.02 | 19994 | |
| 0.03 | 20242 | |
| 0.04 | 19825 |
product_size
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 5 | 5 |
| Mean length | 5.333501 | 5.328933333 |
| Min length | 5 | 5 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Small | Medium |
| 2nd row | Medium | Small |
| 3rd row | Medium | Small |
| 4th row | Large | Large |
| 5th row | Small | Small |
| Value | Count | Frequency (%) |
| large | 333964 | |
| medium | 333501 | |
| small | 332535 |
| Value | Count | Frequency (%) |
| large | 10120 | |
| small | 10012 | |
| medium | 9868 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20132 | |
| l | 20024 | |
| e | 19988 | |
| m | 19880 | |
| g | 10120 | |
| r | 10120 | |
| L | 10120 | |
| S | 10012 | |
| M | 9868 | |
| d | 9868 | |
| Other values (2) | 19736 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159868 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20132 | |
| l | 20024 | |
| e | 19988 | |
| m | 19880 | |
| g | 10120 | |
| r | 10120 | |
| L | 10120 | |
| S | 10012 | |
| M | 9868 | |
| d | 9868 | |
| Other values (2) | 19736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159868 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20132 | |
| l | 20024 | |
| e | 19988 | |
| m | 19880 | |
| g | 10120 | |
| r | 10120 | |
| L | 10120 | |
| S | 10012 | |
| M | 9868 | |
| d | 9868 | |
| Other values (2) | 19736 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159868 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20132 | |
| l | 20024 | |
| e | 19988 | |
| m | 19880 | |
| g | 10120 | |
| r | 10120 | |
| L | 10120 | |
| S | 10012 | |
| M | 9868 | |
| d | 9868 | |
| Other values (2) | 19736 |
product_weight
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 991 | 991 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.05437238 | 5.073211 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| 5-th percentile | 0.6 | 0.61 |
| Q1 | 2.58 | 2.6 |
| median | 5.06 | 5.09 |
| Q3 | 7.53 | 7.56 |
| 95-th percentile | 9.5 | 9.49 |
| Maximum | 10 | 10 |
| Range | 9.9 | 9.9 |
| Interquartile range (IQR) | 4.95 | 4.96 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 2.857848487 | 2.855194911 |
| Coefficient of variation (CV) | 0.56542104 | 0.5627983757 |
| Kurtosis | -1.200012392 | -1.203524603 |
| Mean | 5.05437238 | 5.073211 |
| Median Absolute Deviation (MAD) | 2.47 | 2.48 |
| Skewness | -0.001975515497 | -0.01206535638 |
| Sum | 5054372.38 | 152196.33 |
| Variance | 8.167297977 | 8.152137977 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2.51 | 1094 | 0.1% |
| 7.79 | 1092 | 0.1% |
| 3.96 | 1089 | 0.1% |
| 3.55 | 1089 | 0.1% |
| 1.61 | 1088 | 0.1% |
| 5.24 | 1088 | 0.1% |
| 1.74 | 1087 | 0.1% |
| 4.66 | 1085 | 0.1% |
| 3.04 | 1082 | 0.1% |
| 1.22 | 1081 | 0.1% |
| Other values (981) | 989125 |
| Value | Count | Frequency (%) |
| 4.7 | 51 | 0.2% |
| 9.34 | 48 | 0.2% |
| 8.52 | 47 | 0.2% |
| 2.3 | 46 | 0.2% |
| 7.86 | 46 | 0.2% |
| 7.57 | 46 | 0.2% |
| 9.31 | 46 | 0.2% |
| 3.34 | 45 | 0.1% |
| 6.89 | 45 | 0.1% |
| 1.61 | 44 | 0.1% |
| Other values (981) | 29536 |
| Value | Count | Frequency (%) |
| 0.1 | 506 | |
| 0.11 | 1031 | |
| 0.12 | 996 | |
| 0.13 | 1001 | |
| 0.14 | 1007 |
| Value | Count | Frequency (%) |
| 0.1 | 11 | < 0.1% |
| 0.11 | 33 | |
| 0.12 | 38 | |
| 0.13 | 37 | |
| 0.14 | 28 |
| Value | Count | Frequency (%) |
| 0.1 | 11 | < 0.1% |
| 0.11 | 33 | |
| 0.12 | 38 | |
| 0.13 | 37 | |
| 0.14 | 28 |
| Value | Count | Frequency (%) |
| 0.1 | 506 | |
| 0.11 | 1031 | |
| 0.12 | 996 | |
| 0.13 | 1001 | |
| 0.14 | 1007 |
product_color
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 5 | 5 |
| Median length | 5 | 5 |
| Mean length | 4.399397 | 4.3939 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Red | Green |
| 2nd row | Blue | Red |
| 3rd row | Green | White |
| 4th row | Blue | Blue |
| 5th row | Red | Green |
| Value | Count | Frequency (%) |
| blue | 200671 | |
| green | 200202 | |
| red | 199966 | |
| black | 199704 | |
| white | 199457 |
| Value | Count | Frequency (%) |
| red | 6070 | |
| blue | 6043 | |
| black | 6007 | |
| green | 5974 | |
| white | 5906 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 29967 | |
| B | 12050 | 9.1% |
| l | 12050 | 9.1% |
| R | 6070 | 4.6% |
| d | 6070 | 4.6% |
| u | 6043 | 4.6% |
| a | 6007 | 4.6% |
| c | 6007 | 4.6% |
| k | 6007 | 4.6% |
| G | 5974 | 4.5% |
| Other values (6) | 35572 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 131817 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 29967 | |
| B | 12050 | 9.1% |
| l | 12050 | 9.1% |
| R | 6070 | 4.6% |
| d | 6070 | 4.6% |
| u | 6043 | 4.6% |
| a | 6007 | 4.6% |
| c | 6007 | 4.6% |
| k | 6007 | 4.6% |
| G | 5974 | 4.5% |
| Other values (6) | 35572 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 131817 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 29967 | |
| B | 12050 | 9.1% |
| l | 12050 | 9.1% |
| R | 6070 | 4.6% |
| d | 6070 | 4.6% |
| u | 6043 | 4.6% |
| a | 6007 | 4.6% |
| c | 6007 | 4.6% |
| k | 6007 | 4.6% |
| G | 5974 | 4.5% |
| Other values (6) | 35572 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 131817 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 29967 | |
| B | 12050 | 9.1% |
| l | 12050 | 9.1% |
| R | 6070 | 4.6% |
| d | 6070 | 4.6% |
| u | 6043 | 4.6% |
| a | 6007 | 4.6% |
| c | 6007 | 4.6% |
| k | 6007 | 4.6% |
| G | 5974 | 4.5% |
| Other values (6) | 35572 |
product_material
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 5 | 5 |
| Mean length | 5.25087 | 5.262533333 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Metal | Wood |
| 2nd row | Metal | Wood |
| 3rd row | Plastic | Plastic |
| 4th row | Wood | Glass |
| 5th row | Metal | Plastic |
| Value | Count | Frequency (%) |
| plastic | 250483 | |
| wood | 250096 | |
| metal | 249896 | |
| glass | 249525 |
| Value | Count | Frequency (%) |
| plastic | 7669 | |
| glass | 7513 | |
| wood | 7462 | |
| metal | 7356 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22695 | |
| a | 22538 | |
| l | 22538 | |
| t | 15025 | |
| o | 14924 | |
| P | 7669 | 4.9% |
| i | 7669 | 4.9% |
| c | 7669 | 4.9% |
| G | 7513 | 4.8% |
| W | 7462 | 4.7% |
| Other values (3) | 22174 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157876 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22695 | |
| a | 22538 | |
| l | 22538 | |
| t | 15025 | |
| o | 14924 | |
| P | 7669 | 4.9% |
| i | 7669 | 4.9% |
| c | 7669 | 4.9% |
| G | 7513 | 4.8% |
| W | 7462 | 4.7% |
| Other values (3) | 22174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157876 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22695 | |
| a | 22538 | |
| l | 22538 | |
| t | 15025 | |
| o | 14924 | |
| P | 7669 | 4.9% |
| i | 7669 | 4.9% |
| c | 7669 | 4.9% |
| G | 7513 | 4.8% |
| W | 7462 | 4.7% |
| Other values (3) | 22174 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157876 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22695 | |
| a | 22538 | |
| l | 22538 | |
| t | 15025 | |
| o | 14924 | |
| P | 7669 | 4.9% |
| i | 7669 | 4.9% |
| c | 7669 | 4.9% |
| G | 7513 | 4.8% |
| W | 7462 | 4.7% |
| Other values (3) | 22174 |
product_manufacture_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 992037 | 29992 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 984126 | 29984 ? |
| Unique (%) | 98.4% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2019-08-04 01:47:01 | 2019-03-16 10:53:28 |
| 2nd row | 2019-10-23 19:59:17 | 2018-09-16 06:18:11 |
| 3rd row | 2018-05-12 08:00:29 | 2018-12-23 10:31:52 |
| 4th row | 2019-11-15 16:17:29 | 2019-10-23 08:43:30 |
| 5th row | 2019-08-27 02:58:19 | 2019-01-02 12:03:08 |
| Value | Count | Frequency (%) |
| 2018-04-10 | 1514 | 0.1% |
| 2019-03-19 | 1490 | 0.1% |
| 2018-02-26 | 1471 | 0.1% |
| 2018-06-18 | 1467 | 0.1% |
| 2018-09-24 | 1462 | 0.1% |
| 2019-01-25 | 1457 | 0.1% |
| 2019-01-30 | 1456 | 0.1% |
| 2018-04-28 | 1454 | 0.1% |
| 2019-07-04 | 1453 | 0.1% |
| 2019-01-09 | 1453 | 0.1% |
| Other values (87119) | 1985323 |
| Value | Count | Frequency (%) |
| 2019-09-18 | 61 | 0.1% |
| 2018-02-06 | 59 | 0.1% |
| 2018-12-13 | 59 | 0.1% |
| 2019-02-03 | 58 | 0.1% |
| 2019-01-20 | 55 | 0.1% |
| 2019-02-14 | 55 | 0.1% |
| 2018-10-01 | 55 | 0.1% |
| 2018-04-10 | 54 | 0.1% |
| 2019-02-01 | 54 | 0.1% |
| 2019-12-28 | 54 | 0.1% |
| Other values (25987) | 59436 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 99034 | |
| 1 | 88344 | |
| 2 | 72210 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 8 | 29035 | 5.1% |
| 9 | 28774 | 5.0% |
| 3 | 27023 | 4.7% |
| 4 | 24155 | 4.2% |
| Other values (3) | 51425 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 99034 | |
| 1 | 88344 | |
| 2 | 72210 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 8 | 29035 | 5.1% |
| 9 | 28774 | 5.0% |
| 3 | 27023 | 4.7% |
| 4 | 24155 | 4.2% |
| Other values (3) | 51425 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 99034 | |
| 1 | 88344 | |
| 2 | 72210 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 8 | 29035 | 5.1% |
| 9 | 28774 | 5.0% |
| 3 | 27023 | 4.7% |
| 4 | 24155 | 4.2% |
| Other values (3) | 51425 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 99034 | |
| 1 | 88344 | |
| 2 | 72210 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 8 | 29035 | 5.1% |
| 9 | 28774 | 5.0% |
| 3 | 27023 | 4.7% |
| 4 | 24155 | 4.2% |
| Other values (3) | 51425 |
product_expiry_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 992042 | 29990 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 984121 | 29980 ? |
| Unique (%) | 98.4% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2022-05-28 14:54:02 | 2022-01-03 18:34:16 |
| 2nd row | 2022-12-19 08:04:41 | 2023-12-15 07:55:54 |
| 3rd row | 2023-02-01 12:15:07 | 2023-01-10 02:39:45 |
| 4th row | 2023-02-05 11:46:57 | 2022-08-09 14:28:49 |
| 5th row | 2023-10-05 08:13:07 | 2022-11-19 06:40:08 |
| Value | Count | Frequency (%) |
| 2022-12-22 | 1476 | 0.1% |
| 2022-06-08 | 1475 | 0.1% |
| 2022-01-28 | 1473 | 0.1% |
| 2023-03-13 | 1472 | 0.1% |
| 2022-10-06 | 1468 | 0.1% |
| 2022-06-27 | 1468 | 0.1% |
| 2023-07-08 | 1457 | 0.1% |
| 2023-07-23 | 1457 | 0.1% |
| 2023-01-11 | 1452 | 0.1% |
| 2022-04-17 | 1450 | 0.1% |
| Other values (87119) | 1985352 |
| Value | Count | Frequency (%) |
| 2022-04-24 | 68 | 0.1% |
| 2023-11-08 | 64 | 0.1% |
| 2023-10-01 | 63 | 0.1% |
| 2022-07-14 | 61 | 0.1% |
| 2022-05-05 | 57 | 0.1% |
| 2023-06-01 | 57 | 0.1% |
| 2022-08-22 | 56 | 0.1% |
| 2022-05-27 | 56 | 0.1% |
| 2023-12-09 | 56 | 0.1% |
| 2023-04-30 | 56 | 0.1% |
| Other values (26089) | 59406 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117113 | |
| 0 | 98988 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58442 | |
| 3 | 41845 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24111 | 4.2% |
| 4 | 23724 | 4.2% |
| 6 | 14101 | 2.5% |
| Other values (3) | 41676 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117113 | |
| 0 | 98988 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58442 | |
| 3 | 41845 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24111 | 4.2% |
| 4 | 23724 | 4.2% |
| 6 | 14101 | 2.5% |
| Other values (3) | 41676 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117113 | |
| 0 | 98988 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58442 | |
| 3 | 41845 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24111 | 4.2% |
| 4 | 23724 | 4.2% |
| 6 | 14101 | 2.5% |
| Other values (3) | 41676 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117113 | |
| 0 | 98988 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58442 | |
| 3 | 41845 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24111 | 4.2% |
| 4 | 23724 | 4.2% |
| 6 | 14101 | 2.5% |
| Other values (3) | 41676 | 7.3% |
product_shelf_life
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 365 | 365 |
| Distinct (%) | < 0.1% | 1.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 181.876207 | 181.6765667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 364 | 364 |
| Zeros | 2713 | 93 |
| Zeros (%) | 0.3% | 0.3% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 18 | 18 |
| Q1 | 91 | 91 |
| median | 182 | 182 |
| Q3 | 273 | 273 |
| 95-th percentile | 346 | 346 |
| Maximum | 364 | 364 |
| Range | 364 | 364 |
| Interquartile range (IQR) | 182 | 182 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 105.2288552 | 105.2236841 |
| Coefficient of variation (CV) | 0.5785740585 | 0.5791813772 |
| Kurtosis | -1.198082782 | -1.198336941 |
| Mean | 181.876207 | 181.6765667 |
| Median Absolute Deviation (MAD) | 91 | 91 |
| Skewness | 0.0006229204449 | 3.395146203 × 10-6 |
| Sum | 181876207 | 5450297 |
| Variance | 11073.11197 | 11072.02369 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 87 | 2893 | 0.3% |
| 272 | 2874 | 0.3% |
| 70 | 2870 | 0.3% |
| 250 | 2870 | 0.3% |
| 210 | 2862 | 0.3% |
| 224 | 2859 | 0.3% |
| 238 | 2857 | 0.3% |
| 33 | 2848 | 0.3% |
| 297 | 2847 | 0.3% |
| 171 | 2845 | 0.3% |
| Other values (355) | 971375 |
| Value | Count | Frequency (%) |
| 95 | 111 | 0.4% |
| 196 | 102 | 0.3% |
| 282 | 100 | 0.3% |
| 142 | 100 | 0.3% |
| 263 | 100 | 0.3% |
| 304 | 100 | 0.3% |
| 169 | 99 | 0.3% |
| 284 | 99 | 0.3% |
| 74 | 98 | 0.3% |
| 92 | 98 | 0.3% |
| Other values (355) | 28993 |
| Value | Count | Frequency (%) |
| 0 | 2713 | |
| 1 | 2788 | |
| 2 | 2776 | |
| 3 | 2725 | |
| 4 | 2788 |
| Value | Count | Frequency (%) |
| 0 | 93 | |
| 1 | 88 | |
| 2 | 88 | |
| 3 | 80 | |
| 4 | 89 |
| Value | Count | Frequency (%) |
| 0 | 93 | |
| 1 | 88 | |
| 2 | 88 | |
| 3 | 80 | |
| 4 | 89 |
| Value | Count | Frequency (%) |
| 0 | 2713 | |
| 1 | 2788 | |
| 2 | 2776 | |
| 3 | 2725 | |
| 4 | 2788 |
promotion_id
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 999 | 999 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.920037 | 500.8484333 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 999 | 999 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 50 | 50 |
| Q1 | 250 | 253 |
| median | 500 | 503.5 |
| Q3 | 750 | 748 |
| 95-th percentile | 949 | 949 |
| Maximum | 999 | 999 |
| Range | 998 | 998 |
| Interquartile range (IQR) | 500 | 495 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 288.4530565 | 287.71409 |
| Coefficient of variation (CV) | 0.57699839 | 0.5744534091 |
| Kurtosis | -1.200677574 | -1.19046795 |
| Mean | 499.920037 | 500.8484333 |
| Median Absolute Deviation (MAD) | 250 | 247.5 |
| Skewness | -0.0008935044332 | -0.004262817372 |
| Sum | 499920037 | 15025453 |
| Variance | 83205.16579 | 82779.39757 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 52 | 1092 | 0.1% |
| 94 | 1082 | 0.1% |
| 374 | 1079 | 0.1% |
| 117 | 1077 | 0.1% |
| 29 | 1075 | 0.1% |
| 603 | 1075 | 0.1% |
| 512 | 1073 | 0.1% |
| 949 | 1073 | 0.1% |
| 885 | 1073 | 0.1% |
| 51 | 1070 | 0.1% |
| Other values (989) | 989231 |
| Value | Count | Frequency (%) |
| 611 | 50 | 0.2% |
| 305 | 47 | 0.2% |
| 339 | 46 | 0.2% |
| 949 | 45 | 0.1% |
| 419 | 45 | 0.1% |
| 240 | 45 | 0.1% |
| 11 | 44 | 0.1% |
| 525 | 44 | 0.1% |
| 211 | 44 | 0.1% |
| 203 | 44 | 0.1% |
| Other values (989) | 29546 |
| Value | Count | Frequency (%) |
| 1 | 1033 | |
| 2 | 995 | |
| 3 | 1036 | |
| 4 | 1024 | |
| 5 | 992 |
| Value | Count | Frequency (%) |
| 1 | 34 | |
| 2 | 31 | |
| 3 | 44 | |
| 4 | 34 | |
| 5 | 17 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 34 | |
| 2 | 31 | |
| 3 | 44 | |
| 4 | 34 | |
| 5 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1033 | |
| 2 | 995 | |
| 3 | 1036 | |
| 4 | 1024 | |
| 5 | 992 |
promotion_type
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 20 | 20 |
| Median length | 10 | 10 |
| Mean length | 12.334064 | 12.35276667 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 20% Off | Buy One Get One Free |
| 2nd row | Flash Sale | 20% Off |
| 3rd row | Flash Sale | 20% Off |
| 4th row | Buy One Get One Free | Flash Sale |
| 5th row | Flash Sale | 20% Off |
| Value | Count | Frequency (%) |
| one | 667040 | |
| 20 | 333712 | |
| off | 333712 | |
| buy | 333520 | |
| get | 333520 | |
| free | 333520 | |
| flash | 332768 | |
| sale | 332768 |
| Value | Count | Frequency (%) |
| one | 20110 | |
| buy | 10055 | |
| get | 10055 | |
| free | 10055 | |
| 20 | 9989 | |
| off | 9989 | |
| flash | 9956 | |
| sale | 9956 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60231 | |
| 60165 | ||
| O | 30099 | 8.1% |
| n | 20110 | 5.4% |
| F | 20011 | 5.4% |
| f | 19978 | 5.4% |
| a | 19912 | 5.4% |
| l | 19912 | 5.4% |
| y | 10055 | 2.7% |
| u | 10055 | 2.7% |
| Other values (10) | 100055 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370583 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60231 | |
| 60165 | ||
| O | 30099 | 8.1% |
| n | 20110 | 5.4% |
| F | 20011 | 5.4% |
| f | 19978 | 5.4% |
| a | 19912 | 5.4% |
| l | 19912 | 5.4% |
| y | 10055 | 2.7% |
| u | 10055 | 2.7% |
| Other values (10) | 100055 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370583 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60231 | |
| 60165 | ||
| O | 30099 | 8.1% |
| n | 20110 | 5.4% |
| F | 20011 | 5.4% |
| f | 19978 | 5.4% |
| a | 19912 | 5.4% |
| l | 19912 | 5.4% |
| y | 10055 | 2.7% |
| u | 10055 | 2.7% |
| Other values (10) | 100055 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370583 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60231 | |
| 60165 | ||
| O | 30099 | 8.1% |
| n | 20110 | 5.4% |
| F | 20011 | 5.4% |
| f | 19978 | 5.4% |
| a | 19912 | 5.4% |
| l | 19912 | 5.4% |
| y | 10055 | 2.7% |
| u | 10055 | 2.7% |
| Other values (10) | 100055 |
promotion_start_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 984258 | 29981 |
| Distinct (%) | 98.4% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 968681 | 29962 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2021-07-14 14:28:42 | 2021-10-25 19:14:52 |
| 2nd row | 2021-09-23 04:26:09 | 2021-04-19 09:43:32 |
| 3rd row | 2021-06-13 12:31:15 | 2021-09-11 21:37:21 |
| 4th row | 2021-05-23 05:42:48 | 2021-10-20 06:59:40 |
| 5th row | 2021-04-19 04:55:32 | 2021-11-29 22:29:55 |
| Value | Count | Frequency (%) |
| 2021-03-05 | 2885 | 0.1% |
| 2021-02-07 | 2874 | 0.1% |
| 2021-06-23 | 2871 | 0.1% |
| 2021-05-15 | 2867 | 0.1% |
| 2021-08-27 | 2863 | 0.1% |
| 2021-11-04 | 2862 | 0.1% |
| 2021-03-25 | 2858 | 0.1% |
| 2021-09-06 | 2854 | 0.1% |
| 2021-12-21 | 2851 | 0.1% |
| 2021-08-06 | 2850 | 0.1% |
| Other values (86754) | 1971365 |
| Value | Count | Frequency (%) |
| 2021-09-19 | 116 | 0.2% |
| 2021-12-10 | 111 | 0.2% |
| 2021-12-01 | 107 | 0.2% |
| 2021-04-13 | 106 | 0.2% |
| 2021-05-20 | 106 | 0.2% |
| 2021-08-27 | 104 | 0.2% |
| 2021-06-26 | 103 | 0.2% |
| 2021-10-22 | 102 | 0.2% |
| 2021-06-27 | 101 | 0.2% |
| 2021-08-15 | 101 | 0.2% |
| Other values (25739) | 58943 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102720 | |
| 0 | 99050 | |
| 1 | 87808 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 27059 | 4.7% |
| 5 | 23833 | 4.2% |
| 4 | 23806 | 4.2% |
| 8 | 14029 | 2.5% |
| Other values (3) | 41695 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102720 | |
| 0 | 99050 | |
| 1 | 87808 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 27059 | 4.7% |
| 5 | 23833 | 4.2% |
| 4 | 23806 | 4.2% |
| 8 | 14029 | 2.5% |
| Other values (3) | 41695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102720 | |
| 0 | 99050 | |
| 1 | 87808 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 27059 | 4.7% |
| 5 | 23833 | 4.2% |
| 4 | 23806 | 4.2% |
| 8 | 14029 | 2.5% |
| Other values (3) | 41695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102720 | |
| 0 | 99050 | |
| 1 | 87808 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 27059 | 4.7% |
| 5 | 23833 | 4.2% |
| 4 | 23806 | 4.2% |
| 8 | 14029 | 2.5% |
| Other values (3) | 41695 |
promotion_end_date
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 984252 | 29988 |
| Distinct (%) | 98.4% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 968676 | 29976 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | 2022-12-30 13:04:13 | 2022-09-18 10:14:41 |
| 2nd row | 2022-09-13 03:16:26 | 2022-02-20 19:03:50 |
| 3rd row | 2022-03-13 00:53:35 | 2022-06-30 21:32:34 |
| 4th row | 2022-02-06 00:42:30 | 2022-05-11 20:34:34 |
| 5th row | 2022-12-04 13:07:09 | 2022-02-13 01:03:21 |
| Value | Count | Frequency (%) |
| 2022-08-06 | 2905 | 0.1% |
| 2022-03-08 | 2896 | 0.1% |
| 2022-09-28 | 2874 | 0.1% |
| 2022-02-22 | 2873 | 0.1% |
| 2022-09-16 | 2872 | 0.1% |
| 2022-06-10 | 2865 | 0.1% |
| 2022-07-13 | 2858 | 0.1% |
| 2022-12-15 | 2854 | 0.1% |
| 2022-12-31 | 2847 | 0.1% |
| 2022-08-25 | 2842 | 0.1% |
| Other values (86755) | 1971314 |
| Value | Count | Frequency (%) |
| 2022-01-15 | 111 | 0.2% |
| 2022-09-29 | 109 | 0.2% |
| 2022-07-18 | 109 | 0.2% |
| 2022-07-15 | 104 | 0.2% |
| 2022-05-04 | 103 | 0.2% |
| 2022-09-18 | 102 | 0.2% |
| 2022-10-04 | 102 | 0.2% |
| 2022-03-30 | 100 | 0.2% |
| 2022-02-04 | 100 | 0.2% |
| 2022-03-17 | 99 | 0.2% |
| Other values (25838) | 58961 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132124 | |
| 0 | 99096 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58434 | |
| 30000 | 5.3% | |
| 3 | 26803 | 4.7% |
| 5 | 24161 | 4.2% |
| 4 | 23859 | 4.2% |
| 8 | 14080 | 2.5% |
| Other values (3) | 41443 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132124 | |
| 0 | 99096 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58434 | |
| 30000 | 5.3% | |
| 3 | 26803 | 4.7% |
| 5 | 24161 | 4.2% |
| 4 | 23859 | 4.2% |
| 8 | 14080 | 2.5% |
| Other values (3) | 41443 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132124 | |
| 0 | 99096 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58434 | |
| 30000 | 5.3% | |
| 3 | 26803 | 4.7% |
| 5 | 24161 | 4.2% |
| 4 | 23859 | 4.2% |
| 8 | 14080 | 2.5% |
| Other values (3) | 41443 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132124 | |
| 0 | 99096 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58434 | |
| 30000 | 5.3% | |
| 3 | 26803 | 4.7% |
| 5 | 24161 | 4.2% |
| 4 | 23859 | 4.2% |
| 8 | 14080 | 2.5% |
| Other values (3) | 41443 | 7.3% |
promotion_effectiveness
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.333407 | 4.338333333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | Low | High |
| 3rd row | Low | High |
| 4th row | High | High |
| 5th row | Medium | Medium |
| Value | Count | Frequency (%) |
| high | 333660 | |
| medium | 333249 | |
| low | 333091 |
| Value | Count | Frequency (%) |
| high | 10171 | |
| medium | 9993 | |
| low | 9836 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 20164 | |
| H | 10171 | |
| g | 10171 | |
| h | 10171 | |
| M | 9993 | |
| e | 9993 | |
| d | 9993 | |
| u | 9993 | |
| m | 9993 | |
| L | 9836 | |
| Other values (2) | 19672 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130150 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 20164 | |
| H | 10171 | |
| g | 10171 | |
| h | 10171 | |
| M | 9993 | |
| e | 9993 | |
| d | 9993 | |
| u | 9993 | |
| m | 9993 | |
| L | 9836 | |
| Other values (2) | 19672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130150 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 20164 | |
| H | 10171 | |
| g | 10171 | |
| h | 10171 | |
| M | 9993 | |
| e | 9993 | |
| d | 9993 | |
| u | 9993 | |
| m | 9993 | |
| L | 9836 | |
| Other values (2) | 19672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130150 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 20164 | |
| H | 10171 | |
| g | 10171 | |
| h | 10171 | |
| M | 9993 | |
| e | 9993 | |
| d | 9993 | |
| u | 9993 | |
| m | 9993 | |
| L | 9836 | |
| Other values (2) | 19672 |
promotion_channel
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 12 | 12 |
| Median length | 8 | 8 |
| Mean length | 8.665428 | 8.667266667 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Online | Social Media |
| 2nd row | Social Media | Online |
| 3rd row | Online | Social Media |
| 4th row | Social Media | Social Media |
| 5th row | Online | In-store |
| Value | Count | Frequency (%) |
| online | 333694 | |
| social | 333204 | |
| media | 333204 | |
| in-store | 333102 |
| Value | Count | Frequency (%) |
| in-store | 10060 | |
| social | 9983 | |
| media | 9983 | |
| online | 9957 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| e | 30000 | |
| n | 29974 | |
| i | 29923 | |
| o | 20043 | 7.7% |
| a | 19966 | 7.7% |
| l | 19940 | 7.7% |
| I | 10060 | 3.9% |
| t | 10060 | 3.9% |
| - | 10060 | 3.9% |
| s | 10060 | 3.9% |
| Other values (7) | 69932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 260018 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| e | 30000 | |
| n | 29974 | |
| i | 29923 | |
| o | 20043 | 7.7% |
| a | 19966 | 7.7% |
| l | 19940 | 7.7% |
| I | 10060 | 3.9% |
| t | 10060 | 3.9% |
| - | 10060 | 3.9% |
| s | 10060 | 3.9% |
| Other values (7) | 69932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 260018 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| e | 30000 | |
| n | 29974 | |
| i | 29923 | |
| o | 20043 | 7.7% |
| a | 19966 | 7.7% |
| l | 19940 | 7.7% |
| I | 10060 | 3.9% |
| t | 10060 | 3.9% |
| - | 10060 | 3.9% |
| s | 10060 | 3.9% |
| Other values (7) | 69932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 260018 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| e | 30000 | |
| n | 29974 | |
| i | 29923 | |
| o | 20043 | 7.7% |
| a | 19966 | 7.7% |
| l | 19940 | 7.7% |
| I | 10060 | 3.9% |
| t | 10060 | 3.9% |
| - | 10060 | 3.9% |
| s | 10060 | 3.9% |
| Other values (7) | 69932 |
promotion_target_audience
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 13 | 19 |
| Mean length | 15.999292 | 16.0068 |
| Min length | 13 | 13 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | New Customers | Returning Customers |
| 2nd row | New Customers | Returning Customers |
| 3rd row | New Customers | Returning Customers |
| 4th row | Returning Customers | New Customers |
| 5th row | New Customers | Returning Customers |
| Value | Count | Frequency (%) |
| customers | 1000000 | |
| new | 500118 | |
| returning | 499882 |
| Value | Count | Frequency (%) |
| customers | 30000 | |
| returning | 15034 | |
| new | 14966 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45034 | |
| r | 45034 | |
| u | 45034 | |
| n | 30068 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75034 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480204 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45034 | |
| r | 45034 | |
| u | 45034 | |
| n | 30068 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75034 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480204 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45034 | |
| r | 45034 | |
| u | 45034 | |
| n | 30068 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75034 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480204 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45034 | |
| r | 45034 | |
| u | 45034 | |
| n | 30068 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75034 |
customer_zip_code
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 89999 | 25539 |
| Distinct (%) | 9.0% | 85.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 54993.64477 | 54854.25793 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10000 | 10003 |
| Maximum | 99998 | 99985 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10000 | 10003 |
| 5-th percentile | 14491 | 14581.95 |
| Q1 | 32477.75 | 32239 |
| median | 54966 | 54559.5 |
| Q3 | 77493 | 77304.75 |
| 95-th percentile | 95497 | 95450.1 |
| Maximum | 99998 | 99985 |
| Range | 89998 | 89982 |
| Interquartile range (IQR) | 45015.25 | 45065.75 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 25975.8078 | 25945.28587 |
| Coefficient of variation (CV) | 0.4723419934 | 0.4729858145 |
| Kurtosis | -1.199859176 | -1.203477455 |
| Mean | 54993.64477 | 54854.25793 |
| Median Absolute Deviation (MAD) | 22509 | 22515.5 |
| Skewness | 0.00079246458 | 0.01518527042 |
| Sum | 5.499364477 × 1010 | 1645627738 |
| Variance | 674742590.8 | 673157858.7 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 41138 | 27 | < 0.1% |
| 28225 | 27 | < 0.1% |
| 19719 | 27 | < 0.1% |
| 25427 | 27 | < 0.1% |
| 95120 | 26 | < 0.1% |
| 38515 | 26 | < 0.1% |
| 54735 | 26 | < 0.1% |
| 21109 | 26 | < 0.1% |
| 17611 | 25 | < 0.1% |
| 82394 | 25 | < 0.1% |
| Other values (89989) | 999738 |
| Value | Count | Frequency (%) |
| 98876 | 5 | < 0.1% |
| 41905 | 5 | < 0.1% |
| 47419 | 5 | < 0.1% |
| 31912 | 5 | < 0.1% |
| 35444 | 5 | < 0.1% |
| 99244 | 4 | < 0.1% |
| 36854 | 4 | < 0.1% |
| 72876 | 4 | < 0.1% |
| 20323 | 4 | < 0.1% |
| 13575 | 4 | < 0.1% |
| Other values (25529) | 29955 |
| Value | Count | Frequency (%) |
| 10000 | 12 | |
| 10001 | 14 | |
| 10002 | 6 | |
| 10003 | 12 | |
| 10004 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 10003 | 2 | |
| 10007 | 1 | |
| 10009 | 1 | |
| 10011 | 1 | |
| 10012 | 2 |
| Value | Count | Frequency (%) |
| 10003 | 2 | |
| 10007 | 1 | |
| 10009 | 1 | |
| 10011 | 1 | |
| 10012 | 2 |
| Value | Count | Frequency (%) |
| 10000 | 12 | |
| 10001 | 14 | |
| 10002 | 6 | |
| 10003 | 12 | |
| 10004 | 5 | < 0.1% |
customer_city
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 6 | 6 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | City D | City A |
| 2nd row | City A | City A |
| 3rd row | City B | City D |
| 4th row | City A | City D |
| 5th row | City B | City D |
| Value | Count | Frequency (%) |
| city | 1000000 | |
| b | 250788 | 12.5% |
| c | 249955 | 12.5% |
| a | 249698 | 12.5% |
| d | 249559 | 12.5% |
| Value | Count | Frequency (%) |
| city | 30000 | |
| c | 7608 | 12.7% |
| a | 7531 | 12.6% |
| b | 7517 | 12.5% |
| d | 7344 | 12.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37608 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7531 | 4.2% |
| B | 7517 | 4.2% |
| D | 7344 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37608 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7531 | 4.2% |
| B | 7517 | 4.2% |
| D | 7344 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37608 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7531 | 4.2% |
| B | 7517 | 4.2% |
| D | 7344 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37608 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7531 | 4.2% |
| B | 7517 | 4.2% |
| D | 7344 | 4.1% |
customer_state
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | State Y | State Y |
| 2nd row | State X | State Z |
| 3rd row | State X | State Z |
| 4th row | State Y | State Y |
| 5th row | State Z | State Z |
| Value | Count | Frequency (%) |
| state | 1000000 | |
| z | 333674 | 16.7% |
| x | 333196 | 16.7% |
| y | 333130 | 16.7% |
| Value | Count | Frequency (%) |
| state | 30000 | |
| x | 10125 | 16.9% |
| z | 10078 | 16.8% |
| y | 9797 | 16.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10125 | 4.8% |
| Z | 10078 | 4.8% |
| Y | 9797 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10125 | 4.8% |
| Z | 10078 | 4.8% |
| Y | 9797 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10125 | 4.8% |
| Z | 10078 | 4.8% |
| Y | 9797 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10125 | 4.8% |
| Z | 10078 | 4.8% |
| Y | 9797 | 4.7% |
store_zip_code
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 89999 | 25570 |
| Distinct (%) | 9.0% | 85.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 54972.76671 | 54951.9952 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10000 | 10000 |
| Maximum | 99998 | 99994 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 10000 | 10000 |
| 5-th percentile | 14488 | 14347.8 |
| Q1 | 32473 | 32330.25 |
| median | 54961 | 55001 |
| Q3 | 77451 | 77483.25 |
| 95-th percentile | 95470 | 95507.15 |
| Maximum | 99998 | 99994 |
| Range | 89998 | 89994 |
| Interquartile range (IQR) | 44978 | 45153 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 25981.48314 | 26055.24544 |
| Coefficient of variation (CV) | 0.4726246229 | 0.4741455765 |
| Kurtosis | -1.200166165 | -1.2041705 |
| Mean | 54972.76671 | 54951.9952 |
| Median Absolute Deviation (MAD) | 22489.5 | 22594 |
| Skewness | -0.0001039626203 | -0.001870781196 |
| Sum | 5.497276671 × 1010 | 1648559856 |
| Variance | 675037466.1 | 678875815 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 20956 | 28 | < 0.1% |
| 29159 | 27 | < 0.1% |
| 59836 | 26 | < 0.1% |
| 54696 | 26 | < 0.1% |
| 92910 | 26 | < 0.1% |
| 43369 | 26 | < 0.1% |
| 26386 | 26 | < 0.1% |
| 90024 | 26 | < 0.1% |
| 27134 | 26 | < 0.1% |
| 20477 | 26 | < 0.1% |
| Other values (89989) | 999737 |
| Value | Count | Frequency (%) |
| 72397 | 5 | < 0.1% |
| 65367 | 5 | < 0.1% |
| 74371 | 5 | < 0.1% |
| 18725 | 4 | < 0.1% |
| 96681 | 4 | < 0.1% |
| 45189 | 4 | < 0.1% |
| 13087 | 4 | < 0.1% |
| 62639 | 4 | < 0.1% |
| 47243 | 4 | < 0.1% |
| 77335 | 4 | < 0.1% |
| Other values (25560) | 29957 |
| Value | Count | Frequency (%) |
| 10000 | 11 | |
| 10001 | 6 | < 0.1% |
| 10002 | 15 | |
| 10003 | 8 | |
| 10004 | 14 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 10005 | 2 | |
| 10006 | 1 | |
| 10007 | 1 | |
| 10008 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 10005 | 2 | |
| 10006 | 1 | |
| 10007 | 1 | |
| 10008 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 11 | |
| 10001 | 6 | < 0.1% |
| 10002 | 15 | |
| 10003 | 8 | |
| 10004 | 14 |
store_city
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 6 | 6 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | City D | City A |
| 2nd row | City C | City C |
| 3rd row | City A | City C |
| 4th row | City B | City C |
| 5th row | City C | City A |
| Value | Count | Frequency (%) |
| city | 1000000 | |
| d | 250315 | 12.5% |
| c | 250177 | 12.5% |
| b | 249965 | 12.5% |
| a | 249543 | 12.5% |
| Value | Count | Frequency (%) |
| city | 30000 | |
| a | 7602 | 12.7% |
| d | 7567 | 12.6% |
| c | 7500 | 12.5% |
| b | 7331 | 12.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37500 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7602 | 4.2% |
| D | 7567 | 4.2% |
| B | 7331 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37500 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7602 | 4.2% |
| D | 7567 | 4.2% |
| B | 7331 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37500 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7602 | 4.2% |
| D | 7567 | 4.2% |
| B | 7331 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37500 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| A | 7602 | 4.2% |
| D | 7567 | 4.2% |
| B | 7331 | 4.1% |
store_state
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | State Y | State Y |
| 2nd row | State X | State Y |
| 3rd row | State Y | State X |
| 4th row | State Z | State Z |
| 5th row | State X | State Z |
| Value | Count | Frequency (%) |
| state | 1000000 | |
| x | 333702 | 16.7% |
| z | 333602 | 16.7% |
| y | 332696 | 16.6% |
| Value | Count | Frequency (%) |
| state | 30000 | |
| z | 10041 | 16.7% |
| x | 10009 | 16.7% |
| y | 9950 | 16.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10041 | 4.8% |
| X | 10009 | 4.8% |
| Y | 9950 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10041 | 4.8% |
| X | 10009 | 4.8% |
| Y | 9950 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10041 | 4.8% |
| X | 10009 | 4.8% |
| Y | 9950 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10041 | 4.8% |
| X | 10009 | 4.8% |
| Y | 9950 | 4.7% |
distance_to_store
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 10001 | 9505 |
| Distinct (%) | 1.0% | 31.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.97910924 | 49.60062667 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| Maximum | 100 | 100 |
| Zeros | 62 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| 5-th percentile | 5.03 | 4.74 |
| Q1 | 24.97 | 24.55 |
| median | 49.96 | 49.55 |
| Q3 | 74.95 | 74.54 |
| 95-th percentile | 94.98 | 94.8805 |
| Maximum | 100 | 100 |
| Range | 100 | 99.99 |
| Interquartile range (IQR) | 49.98 | 49.99 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.86098911 | 28.88565901 |
| Coefficient of variation (CV) | 0.5774610543 | 0.5823647996 |
| Kurtosis | -1.200199633 | -1.19802704 |
| Mean | 49.97910924 | 49.60062667 |
| Median Absolute Deviation (MAD) | 24.99 | 25 |
| Skewness | 0.001218286468 | 0.01292473608 |
| Sum | 49979109.24 | 1488018.8 |
| Variance | 832.9566927 | 834.3812965 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 99.05 | 139 | < 0.1% |
| 0.01 | 138 | < 0.1% |
| 9.68 | 138 | < 0.1% |
| 30.79 | 136 | < 0.1% |
| 31.27 | 135 | < 0.1% |
| 22.61 | 134 | < 0.1% |
| 40.84 | 134 | < 0.1% |
| 78.41 | 134 | < 0.1% |
| 82.37 | 133 | < 0.1% |
| 89.9 | 133 | < 0.1% |
| Other values (9991) | 998646 |
| Value | Count | Frequency (%) |
| 68.54 | 11 | < 0.1% |
| 59.82 | 11 | < 0.1% |
| 93.82 | 11 | < 0.1% |
| 76.63 | 11 | < 0.1% |
| 15.31 | 10 | < 0.1% |
| 80.79 | 10 | < 0.1% |
| 81.05 | 10 | < 0.1% |
| 47.46 | 10 | < 0.1% |
| 13.25 | 10 | < 0.1% |
| 30.27 | 10 | < 0.1% |
| Other values (9495) | 29896 |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 0.01 | 138 | |
| 0.02 | 88 | |
| 0.03 | 113 | |
| 0.04 | 86 |
| Value | Count | Frequency (%) |
| 0.01 | 4 | |
| 0.02 | 2 | |
| 0.03 | 2 | |
| 0.04 | 2 | |
| 0.05 | 4 |
| Value | Count | Frequency (%) |
| 0.01 | 4 | |
| 0.02 | 2 | |
| 0.03 | 2 | |
| 0.04 | 2 | |
| 0.05 | 4 |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 0.01 | 138 | |
| 0.02 | 88 | |
| 0.03 | 113 | |
| 0.04 | 86 |
holiday_season
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 3 | 2 |
| Mean length | 2.500214 | 2.498866667 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | No | Yes |
| 2nd row | No | No |
| 3rd row | Yes | Yes |
| 4th row | Yes | Yes |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| yes | 500214 | |
| no | 499786 |
| Value | Count | Frequency (%) |
| no | 15034 | |
| yes | 14966 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| N | 15034 | |
| o | 15034 | |
| Y | 14966 | |
| e | 14966 | |
| s | 14966 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 74966 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| N | 15034 | |
| o | 15034 | |
| Y | 14966 | |
| e | 14966 | |
| s | 14966 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 74966 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| N | 15034 | |
| o | 15034 | |
| Y | 14966 | |
| e | 14966 | |
| s | 14966 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 74966 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| N | 15034 | |
| o | 15034 | |
| Y | 14966 | |
| e | 14966 | |
| s | 14966 |
season
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 5.500322 | 5.495533333 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Spring | Spring |
| 2nd row | Summer | Winter |
| 3rd row | Winter | Spring |
| 4th row | Winter | Winter |
| 5th row | Summer | Summer |
| Value | Count | Frequency (%) |
| winter | 250307 | |
| spring | 250169 | |
| fall | 249839 | |
| summer | 249685 |
| Value | Count | Frequency (%) |
| fall | 7567 | |
| spring | 7515 | |
| summer | 7499 | |
| winter | 7419 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22433 | |
| l | 15134 | |
| S | 15014 | |
| m | 14998 | |
| i | 14934 | |
| n | 14934 | |
| e | 14918 | |
| F | 7567 | 4.6% |
| a | 7567 | 4.6% |
| p | 7515 | 4.6% |
| Other values (4) | 29852 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164866 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22433 | |
| l | 15134 | |
| S | 15014 | |
| m | 14998 | |
| i | 14934 | |
| n | 14934 | |
| e | 14918 | |
| F | 7567 | 4.6% |
| a | 7567 | 4.6% |
| p | 7515 | 4.6% |
| Other values (4) | 29852 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164866 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22433 | |
| l | 15134 | |
| S | 15014 | |
| m | 14998 | |
| i | 14934 | |
| n | 14934 | |
| e | 14918 | |
| F | 7567 | 4.6% |
| a | 7567 | 4.6% |
| p | 7515 | 4.6% |
| Other values (4) | 29852 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164866 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22433 | |
| l | 15134 | |
| S | 15014 | |
| m | 14998 | |
| i | 14934 | |
| n | 14934 | |
| e | 14918 | |
| F | 7567 | 4.6% |
| a | 7567 | 4.6% |
| p | 7515 | 4.6% |
| Other values (4) | 29852 |
weekend
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499333 | 2.498233333 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | Yes | Yes |
| 2nd row | Yes | No |
| 3rd row | Yes | Yes |
| 4th row | No | No |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| no | 500667 | |
| yes | 499333 |
| Value | Count | Frequency (%) |
| no | 15053 | |
| yes | 14947 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| N | 15053 | |
| o | 15053 | |
| Y | 14947 | |
| e | 14947 | |
| s | 14947 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 74947 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| N | 15053 | |
| o | 15053 | |
| Y | 14947 | |
| e | 14947 | |
| s | 14947 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 74947 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| N | 15053 | |
| o | 15053 | |
| Y | 14947 | |
| e | 14947 | |
| s | 14947 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 74947 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| N | 15053 | |
| o | 15053 | |
| Y | 14947 | |
| e | 14947 | |
| s | 14947 |
customer_support_calls
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 20 | 20 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 9.496269 | 9.4736 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 19 | 19 |
| Zeros | 49755 | 1525 |
| Zeros (%) | 5.0% | 5.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 1 | 0 |
| Q1 | 4 | 4 |
| median | 9 | 9 |
| Q3 | 14 | 14 |
| 95-th percentile | 18 | 19 |
| Maximum | 19 | 19 |
| Range | 19 | 19 |
| Interquartile range (IQR) | 10 | 10 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 5.761232791 | 5.769172022 |
| Coefficient of variation (CV) | 0.606683824 | 0.608973571 |
| Kurtosis | -1.204539564 | -1.203435083 |
| Mean | 9.496269 | 9.4736 |
| Median Absolute Deviation (MAD) | 5 | 5 |
| Skewness | 0.001572025506 | 0.000146728086 |
| Sum | 9496269 | 284208 |
| Variance | 33.19180327 | 33.28334582 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 50608 | 5.1% |
| 8 | 50350 | 5.0% |
| 4 | 50334 | 5.0% |
| 12 | 50312 | 5.0% |
| 2 | 50158 | 5.0% |
| 11 | 50151 | 5.0% |
| 16 | 50087 | 5.0% |
| 13 | 50074 | 5.0% |
| 9 | 50053 | 5.0% |
| 7 | 50050 | 5.0% |
| Other values (10) | 497823 |
| Value | Count | Frequency (%) |
| 12 | 1606 | 5.4% |
| 2 | 1577 | 5.3% |
| 13 | 1543 | 5.1% |
| 9 | 1535 | 5.1% |
| 19 | 1531 | 5.1% |
| 0 | 1525 | 5.1% |
| 3 | 1517 | 5.1% |
| 17 | 1502 | 5.0% |
| 1 | 1501 | 5.0% |
| 7 | 1500 | 5.0% |
| Other values (10) | 14663 |
| Value | Count | Frequency (%) |
| 0 | 49755 | |
| 1 | 49530 | |
| 2 | 50158 | |
| 3 | 50608 | |
| 4 | 50334 |
| Value | Count | Frequency (%) |
| 0 | 1525 | |
| 1 | 1501 | |
| 2 | 1577 | |
| 3 | 1517 | |
| 4 | 1450 |
| Value | Count | Frequency (%) |
| 0 | 1525 | |
| 1 | 1501 | |
| 2 | 1577 | |
| 3 | 1517 | |
| 4 | 1450 |
| Value | Count | Frequency (%) |
| 0 | 49755 | |
| 1 | 49530 | |
| 2 | 50158 | |
| 3 | 50608 | |
| 4 | 50334 |
email_subscriptions
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499938 | 2.498733333 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | Yes |
| 3rd row | Yes | No |
| 4th row | No | Yes |
| 5th row | No | Yes |
| Value | Count | Frequency (%) |
| no | 500062 | |
| yes | 499938 |
| Value | Count | Frequency (%) |
| no | 15038 | |
| yes | 14962 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| N | 15038 | |
| o | 15038 | |
| Y | 14962 | |
| e | 14962 | |
| s | 14962 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 74962 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| N | 15038 | |
| o | 15038 | |
| Y | 14962 | |
| e | 14962 | |
| s | 14962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 74962 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| N | 15038 | |
| o | 15038 | |
| Y | 14962 | |
| e | 14962 | |
| s | 14962 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 74962 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| N | 15038 | |
| o | 15038 | |
| Y | 14962 | |
| e | 14962 | |
| s | 14962 |
app_usage
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.334299 | 4.3237 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | High | Medium |
| 2nd row | High | Medium |
| 3rd row | Low | Medium |
| 4th row | Low | Medium |
| 5th row | Medium | Low |
| Value | Count | Frequency (%) |
| medium | 333822 | |
| low | 333345 | |
| high | 332833 |
| Value | Count | Frequency (%) |
| low | 10143 | |
| high | 9930 | |
| medium | 9927 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19857 | |
| L | 10143 | |
| w | 10143 | |
| o | 10143 | |
| H | 9930 | |
| g | 9930 | |
| h | 9930 | |
| M | 9927 | |
| e | 9927 | |
| d | 9927 | |
| Other values (2) | 19854 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129711 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19857 | |
| L | 10143 | |
| w | 10143 | |
| o | 10143 | |
| H | 9930 | |
| g | 9930 | |
| h | 9930 | |
| M | 9927 | |
| e | 9927 | |
| d | 9927 | |
| Other values (2) | 19854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129711 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19857 | |
| L | 10143 | |
| w | 10143 | |
| o | 10143 | |
| H | 9930 | |
| g | 9930 | |
| h | 9930 | |
| M | 9927 | |
| e | 9927 | |
| d | 9927 | |
| Other values (2) | 19854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129711 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19857 | |
| L | 10143 | |
| w | 10143 | |
| o | 10143 | |
| H | 9930 | |
| g | 9930 | |
| h | 9930 | |
| M | 9927 | |
| e | 9927 | |
| d | 9927 | |
| Other values (2) | 19854 |
website_visits
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.512951 | 49.5615 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10111 | 293 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 4 | 4 |
| Q1 | 25 | 25 |
| median | 50 | 50 |
| Q3 | 75 | 75 |
| 95-th percentile | 95 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 28.86977699 | 28.77939123 |
| Coefficient of variation (CV) | 0.5830752644 | 0.5806803916 |
| Kurtosis | -1.199464505 | -1.192315994 |
| Mean | 49.512951 | 49.5615 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | -0.0006306812576 | -0.00604114859 |
| Sum | 49512951 | 1486845 |
| Variance | 833.4640237 | 828.2533595 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 58 | 10304 | 1.0% |
| 95 | 10250 | 1.0% |
| 50 | 10235 | 1.0% |
| 62 | 10177 | 1.0% |
| 45 | 10175 | 1.0% |
| 13 | 10166 | 1.0% |
| 38 | 10160 | 1.0% |
| 84 | 10147 | 1.0% |
| 98 | 10136 | 1.0% |
| 93 | 10132 | 1.0% |
| Other values (90) | 898118 |
| Value | Count | Frequency (%) |
| 38 | 331 | 1.1% |
| 52 | 329 | 1.1% |
| 56 | 328 | 1.1% |
| 45 | 328 | 1.1% |
| 53 | 326 | 1.1% |
| 14 | 324 | 1.1% |
| 58 | 324 | 1.1% |
| 3 | 323 | 1.1% |
| 42 | 323 | 1.1% |
| 96 | 322 | 1.1% |
| Other values (90) | 26742 |
| Value | Count | Frequency (%) |
| 0 | 10111 | |
| 1 | 9997 | |
| 2 | 9933 | |
| 3 | 10007 | |
| 4 | 9969 |
| Value | Count | Frequency (%) |
| 0 | 293 | |
| 1 | 304 | |
| 2 | 301 | |
| 3 | 323 | |
| 4 | 280 |
| Value | Count | Frequency (%) |
| 0 | 293 | |
| 1 | 304 | |
| 2 | 301 | |
| 3 | 323 | |
| 4 | 280 |
| Value | Count | Frequency (%) |
| 0 | 10111 | |
| 1 | 9997 | |
| 2 | 9933 | |
| 3 | 10007 | |
| 4 | 9969 |
social_media_engagement
['Text', 'Text']
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Stratified Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.332057 | 4.325733333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Stratified Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Stratified Sample | |
|---|---|---|
| 1st row | High | Medium |
| 2nd row | Medium | Medium |
| 3rd row | Medium | Medium |
| 4th row | Low | High |
| 5th row | Low | Medium |
| Value | Count | Frequency (%) |
| low | 334073 | |
| medium | 333065 | |
| high | 332862 |
| Value | Count | Frequency (%) |
| low | 10084 | |
| high | 9988 | |
| medium | 9928 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 19916 | |
| L | 10084 | |
| w | 10084 | |
| o | 10084 | |
| H | 9988 | |
| g | 9988 | |
| h | 9988 | |
| M | 9928 | |
| e | 9928 | |
| d | 9928 | |
| Other values (2) | 19856 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 129772 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 19916 | |
| L | 10084 | |
| w | 10084 | |
| o | 10084 | |
| H | 9988 | |
| g | 9988 | |
| h | 9988 | |
| M | 9928 | |
| e | 9928 | |
| d | 9928 | |
| Other values (2) | 19856 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 129772 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 19916 | |
| L | 10084 | |
| w | 10084 | |
| o | 10084 | |
| H | 9988 | |
| g | 9988 | |
| h | 9988 | |
| M | 9928 | |
| e | 9928 | |
| d | 9928 | |
| Other values (2) | 19856 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 129772 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 19916 | |
| L | 10084 | |
| w | 10084 | |
| o | 10084 | |
| H | 9988 | |
| g | 9988 | |
| h | 9988 | |
| M | 9928 | |
| e | 9928 | |
| d | 9928 | |
| Other values (2) | 19856 |
days_since_last_purchase
Real number (ℝ)
| Full Dataset | Stratified Sample | |
|---|---|---|
| Distinct | 365 | 365 |
| Distinct (%) | < 0.1% | 1.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 182.027559 | 182.3302 |
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 364 | 364 |
| Zeros | 2768 | 85 |
| Zeros (%) | 0.3% | 0.3% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 18 | 19 |
| Q1 | 91 | 92 |
| median | 182 | 182 |
| Q3 | 273 | 274 |
| 95-th percentile | 346 | 346 |
| Maximum | 364 | 364 |
| Range | 364 | 364 |
| Interquartile range (IQR) | 182 | 182 |
Descriptive statistics
| Full Dataset | Stratified Sample | |
|---|---|---|
| Standard deviation | 105.3645979 | 105.2194266 |
| Coefficient of variation (CV) | 0.5788387123 | 0.5770817266 |
| Kurtosis | -1.199912738 | -1.196932124 |
| Mean | 182.027559 | 182.3302 |
| Median Absolute Deviation (MAD) | 91 | 91 |
| Skewness | -0.0005543132091 | 0.0001539046339 |
| Sum | 182027559 | 5469906 |
| Variance | 11101.69848 | 11071.12774 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 53 | 2916 | 0.3% |
| 72 | 2890 | 0.3% |
| 98 | 2888 | 0.3% |
| 252 | 2869 | 0.3% |
| 364 | 2867 | 0.3% |
| 6 | 2862 | 0.3% |
| 325 | 2857 | 0.3% |
| 136 | 2843 | 0.3% |
| 267 | 2833 | 0.3% |
| 239 | 2832 | 0.3% |
| Other values (355) | 971343 |
| Value | Count | Frequency (%) |
| 114 | 107 | 0.4% |
| 26 | 106 | 0.4% |
| 49 | 103 | 0.3% |
| 163 | 103 | 0.3% |
| 216 | 101 | 0.3% |
| 203 | 101 | 0.3% |
| 329 | 100 | 0.3% |
| 109 | 100 | 0.3% |
| 296 | 99 | 0.3% |
| 337 | 99 | 0.3% |
| Other values (355) | 28981 |
| Value | Count | Frequency (%) |
| 0 | 2768 | |
| 1 | 2752 | |
| 2 | 2701 | |
| 3 | 2709 | |
| 4 | 2786 |
| Value | Count | Frequency (%) |
| 0 | 85 | |
| 1 | 63 | |
| 2 | 83 | |
| 3 | 85 | |
| 4 | 79 |
| Value | Count | Frequency (%) |
| 0 | 85 | |
| 1 | 63 | |
| 2 | 83 | |
| 3 | 85 | |
| 4 | 79 |
| Value | Count | Frequency (%) |
| 0 | 2768 | |
| 1 | 2752 | |
| 2 | 2701 | |
| 3 | 2709 | |
| 4 | 2786 |